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Since  the  first  edition  of  this  book  sea  published  (1961)*  the  amount  of 
literature  devoted  to  the  theory  of  optimal  .madid  reception  methods  Has  increased 
by  more  than  a  factor  of  5  and  a  multi-volume  work  mould  be  required  for  a 
aufficiently-complete  exposition  of  the  contemporary  stats  of  this  theory.  Therefore, 
this  book  should  be  looked  upon  only  ae  an  introduction  ho  the  overall  theory 
of  opciiifiii  rsoio  recaption  iwtnooB  • 


Compared  to  the  first  edition ,  this  book  includes  four  additional  chapters 
20,  21,  and  22)  covering  optimal  nonlinear  filtration,  special  features  of 
tyntbasia  of  optimal  receivers  when  incomplete  a  priori  inforaation  is  available 
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(use  of  the  theory  of  statistical  solutions,  game  theory,  the  theory  of  nonparametric 
methods  of  statistics,  and  the  theory  of  stochastic  approximations),  and  situations 
where  more  than  one  quality  indicator  must  be  examined.  Also,  several  minor  changes 
have  been  made.  In  particular,  about  10%  of  the  material  devoted  to  relatively- 
secondary  problems  has  been  eliminated  to  avoid  a  significant  increase  in  the 
size  of  the  book. 

The  following  sequence  is  used,  as  was  the  case  in  the  first  edition,  to 
simplify  comprehension  of  the  material  and  to  approximate  actual  stages  of  development 
of  this  theory. 

Some  preliminary  remarks  required  for  understanding  subsequent  material  are 
contained  in  Part  I. 

Part  II  is  devoted  to  presentation  of  V.  A.  Kotel'nikov's  theory  of  potential 
noise  immunity  and  analysis  of  certain  assumptions  in  this  theory  from  the  point 
of  view  of  the  present  state  of  the  problem.  Kotel'nikov  assumed  that  all  signal 
parameters  are  precisely  known  at  the  point  of  reception  (with  the  exception  of 
the  message  subject  to  reproduction).  Therefore,  Part  II  in  essence  is  devoted 
to  optimal  reception  of  precisely-known  signals. 

Optimal  reception  of  signals  comprising  several  unknown  (random)  parameters,  /8 
along  with  a  useful  message,  is  examined  in  Part  III.  The  analytical  method  used 
here  in  essence  is  a  direct  offshoot  of  the  Kotel'nikov  method. 

Part  IV  is  devoted  to  analysis  of  optimal  receivers  and  their  properties 
using  modem  mathematical  methods  of  statistics.  Consequently,  if  Part  III  differs 
from  Part  II  mainly  by  the  type  of  signals  examined,  then  Part  IV  differs  from 
the  other  parts  mainly  by  the  analytical  methods  used.  The  methods  presented 
in  Part  IV  are  general  and  moreover  make  it  possible  to  use  vast  results  obtained 
in  mathematical  statistics  to  find  optimal  receivers  and  analyze  their  properties. 

As  was  the  case  in  the  first  edition,  the  author  strived  wherever  possible 
to  simplify  the  presentation  and  make  it  accessible  to  a  wide  range  of  radio 
specialists  and  senior  students  in  higher  educational  institutions. 
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PRELIMINARY  INFORMATION 

CHAPTER  ONE.  INTRODUCTION 

1.1  Problem  Formulation.  Brief  Historical  Sketch 

The  problem  of  noise  immunity,  i.  e.,  the  problem  of  finding  the  best  methods 
of  radio  signal  reception  in  noise,  is  the  fundamental  and  most  complex  radio 
reception  problem.  This  is  explained  by  the  fact  that  the  amount  of  noise  increases 
simultaneously  with  receiver  improvement,  and  signal  reproduction  requirements 
levied  also  increase. 

Optimal  reception  methods  are  those  which  provide,  in  a  certain  sense,  the 
best  reception  of  signals  or  the  messages  they  carry  when  noise  is  present.  That 
which  is  included  in  the  concept  of  "best1*  is  referred  to  ss  optimization.  Minimum 
mean  square  error,  minimum  complete  error  probability,  and  so  forth  may  be  applicable 
criteria. 

In  many  instances,  the  purpose  of  the  receiver  is  to  reproduce  only  the  message 
with  which  this  signal  is  modulated,  rather  than  the  entire  signal.  Unless  stipulated 
specially  otherwise,  the  theory  is  constructed  for  modulated  signals  since  unmodulated 
signals  may  be  considered  individual  cases  of  modulated  signals.  Here,  the  basic 
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assumptions  and  results  of  the  theory  of  optimal  signal  reception  methods  turn 
out  to  be  valid  for  the  most  diverse  signal  types— modulated  and  unmodulated, 
radio technical,  acoustical,  wire  communications  signals,  and  so  forth. 

Therefore,  the  terms  reception  and  receiver,  rather  than  radio  reception 
and  radio  receiver,  will  be  used  in  future  for  commonality.  However,  the  basic 
examples  will  be  taken  from  radio  reception  since,  in  this  instance,  signals  and 
noise  usually  have  the  most  complex  nature  and  protection  against  noise  is  the 
most  pressing. 

From  a  mathematical  point  of  view,  the  task  of  finding  the  optimal  receiver 
boils  down  in  general  terms  to  the  following. 


However,  noise  (prior  to  its  arrival  at  the  receiver)  generally  may  distort 
the  signal  in  a  more  complex  manner,  modulate  one  or  several  signal  parameters 
(amplitude,  phase,  frequency,  and  so  on),  for  example.  Therefore,  generally 


expression  (1.1)  only  denotes  that  oscillation  y(t)  arriving  at  receiver  input 
is  some  function  of  signal-noise  oscillations.  Ue  will  use  ^(t)  to  denote  an 
oscillation  at  a  receiving  device  output. 

We  must  find  that  receiver  structure,  i.  e.f  that  law  of  conversion  of  y(t) 
into  'f (t)  in  which  If  (t)  will  reproduce  message  x(t)  the  best  (in  a  certain 
sense).  Optimization  must  be  formulated  mathematically  to  solve  this  problem 
and  some  signal-plus-noise  y(t)  characteristics  must  be  given,  statistical  charac¬ 
teristics,  for  instance. 

The  following  basic  problems  are  solved  in  the  theory  of  optimum  reception 
methods. 

1.  Selection  of  and  foundation  for  the  corresponding  receiver  optimizations. 

2.  Development  of  theoretical  methods  of  finding  the  optimal  receiver  structure 
i.  e.,  receivers  satisfying  the  selected  optimization. 

3.  Investigation  of  the  properties  of  optimal  receivers,  primarily,  evaluation 
of  their  noise  immunity,  i.  e.,  determination  of  the  minimum  signal  to  provide 

the  given  signal  or  message  reproduction  quality. 

4.  Comparison  of  optimal  receivers  determined  theoretically  with  extant 
receivers  (i.  e. ,  those  already  in  use)  to  find  the  possibilities  for  and  the 
advisibility  of  increasing  the  noise  immunity  of  extant  receivers. 

5.  Comparison  of  optimal  receiver  noise  immunity  for  various  known  types 
of  signals,  in  particular  for  various  known  methods  of  modulating  these  signals, 
to  select  the  most  noise-immune  types  of  signals  and  methods  of  modulation. 

6.  Finding  new,  optimal  signal  types  (modulation  methods)  which  provide 
the  greatest  noise  immunity  as  optimal  receivers  receive  these  signals. 

Problems  of  finding  and  investigating  the  most  noise-immune  systems  are  examined 
not  only  in  the  theory  of  optimal  reception  methods,  but  in  the  general  theory 
of  communications  as  well.  We  will  explain  the  link  between  these  theories. 


Figure  1.2.  (a)  —  Message  source;  (b)  —  Cotter;  (c)  —  Transmitter; 

(d)  --  Communications  line;  (e)  —  Receiver  with  decoder; 

(f)  —  Recipient;  (g)  —  Noise. 

A  communications  system  (Figure  1.2)  rather  than  an  isolated  receiver  is 
examined  in  the  general  theory  of  communcations.  Primary  (useful)  message  x(t) 
is  coded  into  secondary  message  x'(t),  which  modulates  a  transmitter.  The  /II 
modulated  signals  arrive  via  a  communications  line  (radio  path,  wire,  and  so  on) 
to  the  receiver. 

Without  noise  and  other  signal  distortions,  the  oscillation  at  receiver  input 
would  have  the  form 

0(<)  -«,.(<).  (1*3) 

where  x'  =  x'(t)  — -  distortionless  coded  message  x(t). 

Given  ideal  receiver  operation,  after  demodulation  and  decoding,  oscillation 
-y(t)  equalling  or  proportional  to  the  primary  message*  would  be  isolated: 

y  (t)  s  const  x(t).  (1.4) 

However,  noise  may  be  active  throughout  the  entire  communication  system  tract 
(Figure  1.2).  It  may  cause  distortion  of  secondary  (encoded)  message  x'(t)  and 
lead  additionally  to  the  fact  that  oscillation  y(t)  at  receiver  input  will  equal 
a  complex  function  from  uv,(t)  and  noise  um(f)  ,  rather  than  u  ,(t). 

X  A 


*For  brevity,  the  assumption  here  is  that  the  receiver  simply  is  to  reproduce 
the  message. 


Therefore,  oscillation  "J (t)  at  receiver  output  differs  from  (1.4),  i.  e., 
the  receiver  will  reproduce  message  x(t)  with  distortions. 

The  problem  will  be  formulated  in  the  following  way  in  the  general  theory 
of  communications:  find  those  communications  system  unit  (coder,  transmitter, 
communications  line,  and  receiver)  construction  principles  providing  the  best 
(in  a  certain  sense)  message  transmission  from  source  to  recipient  when  noise 

is  present. 

Consequently,  in  the  general  theory  of  communications,  one  seeks  the  optimum 
with  respect  to  all  permissible  signal  types  (including  all  possible  types  of 
oscillation  carriers  and  modulation  and  coding  methods)  and  with  respect  to  all 
possible  methods  of  reception. 

In  the  theory  of  optimal  reception  methods,  one  will  seek  the  optimum  primarily 
with  respect  to  all  possible  methods  of  reception  for  the  selected  (usually 
sufficientlv-broad)  class  of  signals.  Having  obtained  the  corresponding  results, 
it  is  then  possible  to  find  the  optimum  with  respect  to  all  possible  types  of 
oscillation  carriers  and  modulation  methods  (within  the  range  of  the  selected 
signal  class),  assuming  here  that  the  coding  method  is  unchanged  or  generally 
assuming  that  there  is  no  coding. 

This  problem  was  posed  and  successfully  solved  in  several  works,  beginning 
with  those  of  V.  A.  Kotel'nikov  [1]  and  F.  M.  Woodward  [2]. 

Using  the  theory  of  optimal  radio  reception  methods,  it  is  possible  to  take* 
the  final  step — to  find  the  optimum  also  with  respect  to  message  coding  methods 
[182,  183].  Here,  the  problems  that  the  theory  of  optimal  reception  methods  solves 
in  essence  converge  with  the  problems  of  the  general  theory  of  communications. 

The  general  theory  of  communications  evolved  until  the  last  decade  mainly 
based  on  the  classic  theory  of  information.  K.  Shannon  [185]  developed  the  basic 
assumptions  for  this  theory.  Its  main  drawback  is  that  it  did  not  consider  the 
importance  of  information  for  the  problem  posed.  In  particular,  this  hindered 
use  of  the  classic  theory  of  information  for  radar  problems. 


On  the  other  hand,  problems  of  optimal  message  coding  and  information  trans¬ 
mission  rates,  being  fundamental  problems  in  the  general  theory  of  communications 
(transmission  rate  was  examined  only  indirectly  by  limiting  time  T  devoted  to 
message  reproduction)  were  not  addressed  in  the  theory  of  optimal  radio  reception 
methods  until  the  last  decade. 

Therefore,  until  the  last  decade,  the  theory  of  optimal  radio  reception  methods 
and  the  theory  of  information  were  two,  virtually  nonoverlapping,  sections  of  the 
general  theory  of  commutations.  However,  in  recent  years,  several  works  have 
appeared  in  which  ideas  lying  at  the  foundation  of  both  theories  are  addressed. 

Here,  there  has  been  a  significant  measure  of  success  in  surmounting  limitations 
inherent  in  each  of  these  theories,  i.  e.,  in  considering  the  importance  of  infor¬ 
mation  [133,  184,  and  others]  and  in  using  the  theory  of  optimal  reception  methods 
for  problems  in  message  coding  method  optimization  [133,  182,  183,  186,  187,  and 
others].  Therefore,  at  present  no  aufficiently-distinct  boundary  yet  exists  between 
the  theory  of  optimal  reception  methods-  and  the  theory  of  information  and  it  is 
possible  that,  during  the  next  decade,  they  will  merge  completely  into  a  single 
theory  of  optimal  message  transmission  and  reception  methods. 

Now  we  will  refine  what  we  mean  by  the  term  receiver.  If  a  receiver  operates 
within  an  information  transmission  system  (communications  system),  the  Figure 


Figure  1.3.  (a)  —  Message  source  (object);  (b)  —  Radio 

wave  propagation  channel;  (c)  —  Receiver;  (d)  —  Recipient; 

(e)  —  Noise;  (f)  —  Transmitter. 

1.2  schematic  describes  its  role.  In  the  case  of  radar  systems,  the  typical  schematic 
is  the  one  depicted  in  Figure  1.3.  The  message  source  is  an  object,  whose  parameters 
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(coordinates,  speeds,  and  so  on)  must  be  determined.  They  are  extracted  by  the 
receiver  from  the  radio  signal  radiated  or  reflected  by  the  object  (by  an  aircraft, 
space  vehicle,  and  so  on).  In  several  instances,  a  special  radio  transmitter 
painting  the  object  is  required  to  shape  this  radio  signal. 

The  basic  difference  between  a  radar  system  and  a  communications  system  is 
that  spontaneous  modulations  and  coding  of  the  signal  carrier  oscillation  by  the 
message  is  impossible  in  the  former.  Therefore,  one  may  only  seek  the  optimum 
with  respect  to  possible  signal  carrier  oscillation  types  and  reception  methods. 

In  the  case  of  passive  radar,  when  an  object's  natural  radiation  is  used  to  determine 
its  parameters,  optimization  is  possible  only  with  respect  to  reception  methods. 

Any  radio  receiving  device,  regardless  of  the  system  for  which  it  is  /13 
intended,  will  comprise  three  component  parts  (Figure  1.4):  antenna- feeder  device 
A,  receiver,  and  output  device  (recipient).  Signal-plus-noise  E(t,  X,  Y,  Z)  at 
antenna- feeder  device  input  is  a  function  not  only  of  time,  but  of  space  coordinates 
(generally  of  three  coordinates  X,  Y,  and  Z),  and  at  this  device's  output,  i.  e., 
at  receiver  input  it  is  a  function  of  time  y(t). 

A  human  or  any  automatic  device  (loudspeaker,  oscillograph,  electronic  computer, 
and  so  forth)  receiving  reproduced  message  x  may  play  the  role  of  output  device 
(recipient).  The  output  device  is  not  examined  directly  in  the  theory  of  optimal 
radio  reception  methods.  It  is  considered  only  during  mathematical  formulation 
of  requirements  levied  on  receiver  output  effect  'X (t)  (the  requirement  is  for 
the  mean  square  deviation  'X  (t)  from  reproduced  message  x(t)  be  minimal,  for 
example) . 


Figure  1.4.  (a)  —  Receiver;  (b)  —  Output  device  (recipient). 

Either  the  antenna- feeder  device  and  receiver  combination  or  just  the  receiver 
is  subjected  to  optimization  during  the  synthesis  process.  In  the  latter  case, 
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one  will  seek,  as  already  noted  above,  the  optimal  receiver  structure  for  given 
signal-plus-noise  y(t)  at  its  input. 


It  is  evident  that  the  best  results  are  obtained  if  the  antenna-feeder  device 
and  receiver  combination,  rather  than  the  receiver,  is  optimized.  This  requires 
having  statistical  characteristics  of  random  space-time  process  E(t,  X,  Y,  Z) 
rather  than  of  the  random  time  process  y(t).  Here,  both  the  mathematical  formulation 
of  the  synthesis  problem  and  its  solution  are  complicated  radically.  /14 

Complication  of  problem  formulation  means  mathematical  description  of  noise 
space-time  characteristics.  This  description  requires,  in  particular,  very  complete 
a  priori  information  on  the  noise,  which  is  lacking  in  many  cases.  However,  in 
view  of  the  great  importance  of  the  overall  receiving  device  optimization  process, 
including  the  antenna  system  as  well,  great  attention  has  been  devoted  in  recent 
years  to  space-time  synthesis  [134]. 

In  future,  where  this  is  not  stipulated,  the  discussion  will  involve  only 
synthesis  of  the  receiver,  i.  e. ,  the  term  receiver  input  will  be  understood  to 
mean  process  y(t)  at  antenna-feeder  device  output. 

It  should  be  noted  that  Figures  1.1— 1.3  depict  a  single-channel  system.  — 
The  receiver  in  a  multichannel  system  my  have  several  inputs  and  (or)  several 
outputs.  Here,  Figure  1.1,  1.2,  and  1.3  will  remain  valid  if  scalar  functions 
y(t),  7  (t),  x(t),  and  x'(t)  are  replaced  by  the  corresponding  vector  functions 


V(0-{Pi(0>~>Vm(0h 

7  (0  *  {Ti  (0*  •••  •  Yi  (Oh 

. xi  (oh  ^1’5^ 

75)-{V(0 v  (Oh 

where  m  and  1  —  number  of  channels  at  receiver  input  and  output,  respectively. 

Missions  that  modem  radioelectronic  systems  accomplish  often  are  so  complex 
that  the  message  reproduction  process  must  be  divided  into  two  stages:  primary 
8ignal-plu8-noise  processing  and  secondary  processing.  The  division  principle 


may  vary.  For  example,  primary  processing  when  information  transmitted  from  an 
earth  satellite  is  received  may  occur  at  various  points  on  the  earth's  surface, 
while  secondary  processing  occurs  at  a  single  computer  coordinating  center,  which 
issues  the  final  result  T'  (t)  of  message  x(t)  reproduction.  Secondary  processing 
in  recent  years  mainly  occurs  by  means  of  discrete  (digital)  equipment  using 
specialized  or  standard  digital  computers. 

If  problem  conditions  require  that  the  message  reproduction  system  comprise 
two  parts  (primary  and  secondary  processors),  then  the  following  variants  for 
formulation  of  the  problem  for  its  synthesis  are  possible: 

—  optimization  of  the  primary  processor  for  a  given  secondary  processor; 

—•  optimization  of  the  secondary  processor  for  a  given  primary  processor;  /15 

—  optimization  of  both  processors  simultaneously. 

It  is  evident  that  highest-quelity  message  reproduction  may  occur  in  the 
third  variant.  However,  in  several  instances,  this  turns  out  to  be  too  complex 
from  a  mathematical  standpoint  and  from  the  point  of  view  of  technical  realization. 
Therefore,  primary  and  secondary  processing  system  synthesis  often  occurs  inde¬ 
pendently,  i.  e.,  in  accordance  with  the  first  and  second  variants. 

There  is  no  difference  from  the  point  of  view  of  principle  among  the 
aforementioned  three  variants  since  all  may  be  reduced  to  the  receiver  synthesis 
problem  depicted  in  Figure  1.1. 

Here,  in  the  first  variant,  the  term  (t)  (or  (t))  should  be  understood 
to  mean  effect  “Jf  (t)  at  primary  processor  output,  i.  e.,  to  consider  the  secondary 
processor  an  output  device. 

In  the  second  variant,  the  term  y(t)  (or  y(t))  should  be  understood  to  mean 
signal-plus-noise  at  primary  processor  (or  processors)  input,  rather  than  at  antenna 
output. 

In  the  third  variant,  the  term  receiver  should  be  understood  to  mean  the 
combination  of  primary  and  secondary  processors. 


In  spite  of  the  principled  (ideological)  commonality  of  the  three  variants, 
there  are  differences  among  them.  Therefore,  in  recent  years  several  special 
books  [179,  180,  and  others]  were  devoted  to  the  theory  of  optimal  secondary 
processing. 

It  follows  from  what  has  been  stated  that  the  contemporary  theory  of  optimal 
radio  reception  methods  makes  it  possible  to  solve  quite  varied  and  complex  problems 
arising  during  processing  of  essentially  any  radioelectronic  and  other  information 
transmission  and  extraction  systems.  During  the  past  15  years  alone,  dozens  of 
books  [2-9,  11-13,  16,  17,  19,  29-32,  125,  139,  175-183,  and  others]  have  been 
devoted  wholly  or  to  a  significant  degree  to  this  theory.  If  one  sums  up  this 
material,  eliminating  repetition,  he  would  end  up  with  more  than  a  dozen  volumes. 
Therefore,  this  book  should  be  considered  only  an  introduction  to  the  modem  theory 
of  optimal  radio  reception  methods. 

Creation  of  the  theory  of  optimum  linear  filters,  in  which  a  linearity  condition 
is  imposed  on  a  receiver  system  (filter)  separating  the  signal  from  noise, 
historically  preceded  the  theory  of  optimal  reception  methods. 

The  foundations  for  the  theory  of  optimum  linear  filters  were  laid  in  the 
seminal  works  of  Academician  A.  N.  Kolmogorov  [34]  and  N.  Wiener  [28]  as  early 
as  1941-1942.  Wiener  determined  the  requirements  levied  on  the  optimum  linear 
filter  transfer  function  stemming  from  the  condition  of  obtaining  the  minimum 
mean  square  error  at  filter  output. 

Another  criterion  began  to  be  employed  after  1943  for  synthesis  of  optimum 
linear  filters,  this  being  the  criterion  of  the  maximum  ratio  of  peak  signal  voltage 
value  to  the  mean  square  noise  voltage  value.  The  beginnings  here  were  made  /16 
by  the  work  of  North  [113],  who  solved  this  problem  for  white  noise. 

t 

The  works  enumerated,  in  spite  of  their  great  value,  touched  upon  optimal 
synthesis  not  of  the  receiver  as  a  whole,  but  only  of  one  of  its  elements— the 
linear  filter.  V.  A.  Kotel'nikov's  work  Teoriva  potentsial * nov  pomekhoustoychivosti 
("Theory  of  Potential  Noise  Immunity"),  published  in  1946  as  a  doctoral  dissertation, 
was  the  first  work  devoted  to  the  problem  of  finding  the  optimal  receiver  and 
investigation  of  its  properties.  This  work  was  published  without  major  changes 
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in  1956  as  a  monograph  [1],  In  Kotel ' nikov 1 s  work,  almost  all  the  basic  problems 
in  the  contemporary  theory  of  optimal  reception  methods  were  presented  and  solved 
to  a  certain  degree  for  the  first  time.  This  work  retains  its  fundamental  character 
even  today . 

Kotel' nikov  solved  the  problem  for  additive  noise  in  the  form  of  stationary 
gaussian  noise  (a  stationary  fluctuating  oscillation  with  a  normal  law  of 
distribution)  and  for  the  simplest  optimizations  (minimal  mean  square  error  and 
maximum  message  reproduction  a  posteriori  probability  criteria).  In  addition, 
he  made  several  additional  restrictions  and  assumptions  during  his  analysis. 

Some  of  his  restrictions  and  assumptions  successfully  were  removed  in  subsequent 
works.  Cases  of  more  complex  noise  were  examined,  particularly  nonadditive  noise 
leading  to  onset  of  parasitic  random  parameters  in  the  received  oscillation  (i.  e., 
of  those  parameters  unknown  at  the  point  of  reception  not  containing  ary  information 
concerning  the  useful  message  received).  Here,  one  primarily  should  note  the 
work  of  F.  Woodward  [2  and  18],  D.  Middleton  [16,  31,  118],  and  others.  D.  Middleton 
and  others  analyzed  receiver  optimality  for  generalized  statistical  criteria. 

Mathematical  methods  of  statistics — methods  of  testing  statistical  hypotheses 
and  obtaining  statistical  estimates — began  to  be  used  widely  after  the  early  1950's 
to  find  optimal  receivers  and  to  evaluate  their  properties. 

The  foundations  of  the  contemporary  theory  of  statistical  estimates,  i.  e., 
estimation  of  distribution  parameters  of  random  values  and  random  functions,  had 
been  laid  as  early  as  1920-1930  by  R.  Fisher,  and  then  developed  in  the  works 
of  J.  Neyman,  E.  Pearson,  and  other  mathematicians.  Creation  in  1930-1940  of 
the  theory  of  testing  statistical  hypotheses  is  the  work  of  Neyman  and  Pearson. 

These  methods  were  developed  significantly  during  World  War  II  in  works  by  A. 

Wald  [29,  30],  who  developed  the  method  of  series  analysis,  the  theory  of  statistical 
solutions,  and  game  theory. 

D.  Middleton  and  several  other  authors  in  their  works  published  after  1950 
demonstrated  that  all  these  approaches  may  be  used  with  great  success  in  solution 
of  problems  of  methods  of  optimal  signal  reception  in  noise. 


In  accordance  with  the  eforewentioned  sequence  in  the  development  of  the 
theory  of  opt leal  reception  methods  and  for  ease  in  understanding,  the  material  /17 
in  this  book  is  presented  in  the  following  manner. 

Preliminary  information  required  for  construction  of  the  theory  of  optimal 
signal  reception  methods  (general  nature  of  information,  signals,  and  noise, 
optimum  linear  filter  properties,  and  so  forth)  is  presented  in  Part  I. 

A  brief  exposition  of  A.  N.  Kotel'nikov's  theory  of  potential  noise  immunity 
is  provided  in  Part  II. 

In  Part  III,  the  theory  moves  into  i$e  more  general  case  of  signals  with 
parasitic  unknown  parameters.  Here,  as  was  the  case  in  Part  II,  the  theory  will 
be  constructed  only  for  the  simplest  receiver  optimizations — the  criteria  of  maximum 
inverse  probability  and minimal  mean  square  error. 

Part  IV  is  devoted  to  the  search  for  optimal  receivers  with  the  aid  of  con¬ 
temporary  methods  of  mathematical  statistics,  which  make  it  possible  to  solve 
the  problem  for  breed  classes  of  receiver  optimization.  It  also  is  possible 
successfully  to  demonstrate  with  the  aid  of  these  approaches  that  the  results 
obtained  during  analysis  of  optimal  receivers  intended  for  highly-reliable  mid 
-precise  message  reproduction  essentially  are  identical  for  a  broad  class  of 
optimizations,  including  the  simplest  criteria  accepted  by  Kotel’nikov. 

Therefore,  in  spite  of  the  fact  that  the  theory  postulated  in  Parte  II  and 
III  of  the  book  are  based  on  the  simplest  criteria,-  its  results  turn  out  to  be 
valid  for  e  broad  class  of  optimizations  if  the  requirements  for  meeaage  reproduction 
precision  and  (or)  reliability  are  sufficiently  high.  However,  problems  which 
never  arise  in  the  simpler  formulation  of  the  teaks  need  in  Parte  II  and  ill  also 
ere  examined  |n  Part  IV.  This  category  includes,  in  particular,  impacts  of  the 
optimizations  and  typa  of  a  priori  diatrlbiftions  on  optimal  receiver  structure 
and  properties,  methods  of  receiver  synthesis  given  incomplete  a  priori  data, 
the  special  features  with  respect  to  several  quality  indicators,  sequential  signal 
fluictioni  ana  ocners* 


1.2  General  Nature  of  Messages,  Signals,  and  Noise 

In  general,  useful  signal  u  (t)  is  a  function  of  time  modulated  by  message  x. 
There  are  three  characteristic  cases,  depending  on  message  x  type: 

1.  Reception  of  discrete  messages. 

2.  Reception  of  individual  values  of  continuous  messages. 

3.  Reception  of  oscillations  (filtration). 

The  first  case  occurs  if  the  message  may  have  only  discrete  values  x^,  x^, 

.  .  .,  x  .  Here,  the  set  of  possible  messages  must  be  complete,  i.  e.,  include 
all  possible  message  values.  If  problem  conditions  permit  pauses,  i.  e.,  cases 
when  there  are  no  messages,  then  they  must  be  considered  cases  in  which  zero  /18 
message  x  is  transmitted.  All  messages  are  considered  to  be  unknown  magnitudes 
at  the  place  of  reception  (if  you  assume  that  certain  messages  are  known  beforehand 
at  the  place  of  reception,  then  they  are  not  considered  messsages  because  they 
contain  no  information;  reception  of  such  "messages"  in  principle  is  innecessary) . 

In  the  majority  of  cases,  these  unknown  magnitudes  during  reception  may  be 
considered  random  magnitudes  described  by  certain  laws  of  distribution  of  a  random 
variable. 

In  the  simplest  case  of  message  reception,  when  its  signal  u^(t)  corresponds 
to  each  message  value  x^,  while  the  results  of  reception  of  preceding  messages 
are  not  considered  during  reproduction  of  message  x^  in  the  receiver,  the  set 
of  random  messages  (xQ,  x^,  X2»  .  .  «,  xm)  is  characterized  fully  by  the  collection 
of  unidimenaional  probabilities  of  these  messages  P(xQ),  P(x^),  .  •  P(xm). 

More  complex  reception  cases  may  require  consideration  also  of  composite  probabilities 
P(x0,  Xj),  P(xlf  x2),  and  so  on.  All  these  distributions  of  a  random  message 
variable  are  referred  to  as  a  priori  distributions  since  they  are  known  (or  assumed 
to  be  known)  prior  to  the  experiment,  i.  e.,  prior  to  investigation  of  signal- 
plus-noise  y(t)  arriving  at  receiver  input.  In  future,  the  entire  collection 
of  a  priori  prob*ilitiea  of  message  set  (xQ,  xv  .  .  .,  xm)  will  be  designated 

P(x). 
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Distribution  P(x)  for  a  case  where  only  unidimensional  probabilities  P(xk) 
are  considered  is  depicted  in  Figure  1.5.  Since  the  message  set  always  is  complete, 
then 

(1.6a) 

The  second  case  occurs  when  the  received  message  is  a  continuous  magnitude 
having  fixed  value  x  during  each  experiment,  i.  e.,  during  each  observation  cycle 
(T). 


Actual  messages  usually  are  continuous  functions  of  time  x(t).  However, 
if  function  x(t)  changes  slightly  during  observation  cycle  (T)  (Figure  1.6),  then 
one  may  consider  that  reception  of  individual  values  of  continuous  messages  takes 
place.  Here,  message  x  may  be  looked  upon  during  reception  as  an  unknown  magnitude, 


Figure  1.7 


which  may  have  any  value  within  certain  known  limits —  from  *m  to  xmtM  . 

In  the  majority  of  cases,  this  unknown  magnitude  during  reception  may  be  /19 
considered  a  random  magnitude  having  a  certain  a  priori  distribution.  As  usual, 
we  will  denote  this  distribution  P(x)  to  obtain  a  series  of  standard  messages 
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valid  for  any  typoc  of  messages  x,  remembering  however  that  there  is  a  probability 
density  rather  than  a  probability  in  the  case  of  continuous  messages  P(x).* 

In  a  general  case  of  reception,  this  distribution  must  include  unidimensional 
and  multidimensional  probability  densities.  In  the  simplest  cases,  when  the  results 
of  only  one  experiment  (one  observation  cycle  T)  are  considered  as  message  x  is 
being  reproduced,  it  suffices  to  know  unidimensional  distribution  P(x)  similar 
to  that  depicted  in  Figure  1.7. 

In  this  case 


J  P(x)dx~  I. 


(1.6b) 


The  third  case  occurs  if  message  x(t)  changes  over  time  so  rapidly  that  it 
must  be  considered  a  time  function  even  within  the  limits  of  one  observation  cycle 
(T).  In  this  event,  message  x(t)  reproduction  during  reception  must  be  considered 
an  unknown  time  function.  In  a  majority  of  instances,  this  unknown  function  may 
be  looked  upon  as  a  random  function  having  a  certain  a  priori  distribution,  which 
we  again  will  designate  P(x).  But,  P(x)  now  must  be  a  multidimensional  probability 
density  characterizing  the  probability  of  certain  realizations  of  funct.'-.n  *(t)  ** 


Thus,  in  all  cases  desired  message  x  at  the  place  of  reception  is  an  unknown 
magnitude  or  unknown  time  function.  In  a  majority  of  cases,  it  is  possible  to 
consider  this  unknown  magnitude  (unknown  function)  as  a  random  magnitude  (random 
time  function)  characterized  by  some  a  priori  distribution  P(x).  /20 


P(x)  is  a  combination  of  probabilities  if  message  x  is  a  discrete  random 
magnitude.  It  is  a  probability  density  if  message  x  is  a  continuous  random  magnitude. 


*The  possibility  of  obtaining  a  series  of  standard  expressions  for  messages 
that  are  discrete  random  magnitudes,  continuous  random  magnitudes,  and  continuous 
random  time  functions  is  stipulated  by  the  fact  that  certain  important  relationships 
valid  for  probabilities  (the  law  of  probability  multiplication,  for  example)  remain 
valid  for  probability  densities  as  well  (unidimensional  and  multidimensional). 

**That  is,  in  this  case  P(x)dx  is  shorthand  for  the  following  expression: 

P(xr  x 2 f  *  *  *9  xn)dxi»  ^2*  *  *  *  i  dxQ , 
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Distribution  P(x)  always  is  multidimensional  (strictly  speaking,  infinitely- 
dimensional)  during  reception  of  oscillations.  Distribution  P(x)  in  the  simplest 
case  of  reception  may  be  unidimensional  during  reception  of  the  individual  values 
of  continuous  messages. 

Now  we  will  shift  to  the  characteristics  of  the  signals  carrying  message  x. 
These  signals  may  be  divided  into  signals  precisely  known  and  signals  with  unknown 
parameters. 

A  signal  is  referred  to  as  precisely  known  if  message  x  is  the  only  unknown 
parameter  in  oscillation  u^(t),  i*  e.,  oscillation  ux(t)  is  known  precisely,  given 
known  x. 


If,  besides  desired  message  x,  a  signal  comprises  some  other  unknown 
parameters—  Ct  C£  2,  and  so  forth,  not  carrying  any  information  concerning 
x,  then  it  is  referred  to  as  a  signal  with  unknown  parameters  and  it  is  designated 
ux,CC1,CX2(t)  or  u(x»OC1*CC2;  t)f  respectively. 


For  example,  let  message  x  be  transmitted  with  the  aid  of  amplitude  modulation 

A#  6*  f 


«*(0 -«•(>+*)«»  («*+?)•  (1>7) 

Such  a  signal,  besides  message  x,  is  determined  by  three  other  parameters:  uQ, 
67,  and  0.  Therefore,  signal  (1.7)  is  considered  precisely  known  if  uq,  qj  , 
and  0  are  known  beforehand  at  the  place  of  reception  (i.  e. ,  prior  to  reception 
of  combination  y(t)). 

If  phase  0 ,  along  with  x,  also  is  known  at  the  place  of  reception,  then 
instead  of  (1.7)  one  must  write: 

"«*0  -f  *)«»(«>/  + <p).  (1.8) 


Signal  parameters  x,  CX  CX  ,  and  so  on  during  observation  cycle  (T)  may 


be  both  constant  magnitudes  and  time  functions.  In  many  cases,  unknown  parasitic* 
parameters  CX^, Of  2 ,  and  so  forth  may  be  considered  (or  actually  are)  random 
magnitudes  or  random  time  functions.  In  these  cases,  signal  uCxjCtpO^,  *  *  •» 
CX m;  t)  is  referred  to  as  a  signal  with  random  parameters. 


Noise  distorting  a  signal  is  divided  into  additive  and  nonadditive  (modulating) 
noise.  Additive  noise,  as  noted  in  §  1.1,  is  the  term  used  for  noise  «m(0  , 
which  is  a  component  of  combination  y(t): 

y(t)  =  «*(<)  + «u,(0- 

All  other  noise  is  referred  to  as  nonadditive  noise. 

Receiver  internal  noise  is  the  typical  and  most  important  of  the  additive  /21 
noises. 

Nonadditive  noise,  for  example,  includes  noise  arising  in  the  process  of 
radio  wave  propagation  and  reflection  and  causing  signal  parasitic  modulation 
with  respect  to  amplitude  and  phase.  Nonadditive  noise  acts  upon  one  or  several 
signal  parameters,  i.  e. ,  causes  signal  parasitic  modulation,  so  it  may  also  be 
referred  to  as  modulating  noise. 

Since  nonadditive  noise  manifests  itself  in  the  change  of  certain  parameters 

CX  CC  .  .  CX  a  signal  distorted  by  such  noise  may  be  considered  a  signal 
l  t  m 

with  random  parameters  CX  ^,CX  2»  •  •  .»CXm.  Therefore,  signal-plus-noise  y(t) 
may  be  written  in  the  form 

y(t)-u  (x.a,,  a,, ... ,  ctm;  t)  +  um  (t)  C 1 • 

when  nonadditive  noise  is  and  is  not  present. 

Parameters  CX, ,CC  .  .  . ,Ct  may  be  simply  unknown  when  nonadditive  noise 
1  i  m 

is  absent  rather  than  being  mandatory  random  magnitudes.  One  or  several  of  these 


*As  noted  in  §  1.1,  those  signal  parameters  not  carrying  any  information  about 
message  x  that  are  unknown  at  the  place  of  reception  are  called  parasitic  parameters. 
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parameters  usually  are  random  magnitudes  or  even  random  time  functions  when  such 
noise  is  present. 


As  a  rule,  additive  noise  «m(/)  is  a  random  time  function.  Consequently, 
signal-plus-noise  y(t)  also  must  be  considered  a  random  time  function. 

Random  time  functions  f(t),  as  is  known,  are  characterized  by  multidimensional 
distributions  of  the  type  P( f^ ,  f2»  •  •  •»  fj<»  •  •  *»fn)  where  f^,  f2,  .  .  ., 
fk,  .  .  .,  fn  are  values  of  function  f(t)  at  individual  moments  in  time  (t^,  t2, 

.  .  .,  t^,  .  .  .,  tn)  of  observation  cycle  (T),  while  magnitude  n  must  be,  strictly 
speaking,  infinitely  large  (see  §  1.3  on  this  point). 

The  random  process  is  referred  to  as  stationary  (in  the  narrow  sense  of  the 
word)  if  distributions  P(f^,  f2,  •  .  . ,  f  )  do  not  change  when  the  time  reference 
changes  (for  any  values  of  n).  Otherwise,  the  process  is  not  stationary. 

Additive  noise  in  many  cases  may  be  considered  a  stationary  random  process. 

In  particular,  the  most  important  type  of  such  noise— receiver  internal  noise, 
usually  may  be  considered  stationary.  The  majority  of  actual  stationary  processes 
have  the  property  of  ergodicity,  i.  e.,  averaging  of  process  f(t)  with  respect 
to  time  given  the  same  results  as  does  statistical  averaging  for  the  given  moment 
in  time  with  respect  to  all  possible  realizations  of  the  process. 

We  will  designate  averaging  conditionally  by  means  of  the  following  symbols 
since  we  will  in  future  be  called  upon  often  to  use  various  types  of  averaging: 

a)  averaging  with  respect  to  time  for  infinitely-great  time  —  dashed  line 
over  the  averaged  magnitude: 


(1.10a) 


b)  averaging  with  respect  to  time  for  finite  time  T  —  enclosing  the  averaged 
magnitude  in  parentheses  with  subscript  T 


c)  statistical  averaging  (finding  the  expected  value)  —  solid  line  above 
the  average  magnitude  or  the  sign  M  (expected  value)  in  front  of  this  magnitude 

1  —  Mf. 

If  f  —  continuous  random  magnitude,  then 

7 -Mf-  [  fP(l)df,  (1.10c) 

At 

where  —  region  of  all  possible  values  of  magnitude  f. 

For  ergo die  processes 

f(/j*»7-  (l.lOd) 

Such  terminology  as  "white  noise,"  "nonwhite  noise,"  and  "gaussian  noise" 
are  used  in  subsequent  discussions  to  designate  several  of  the  most-important 
types  of  random  noise.  The  term  white  noise  is  understood  to  mean  a  stationary 
fluctuating  oscillation  having  an  equidimensional  frequency  spectrum  in  the  entire 
frequency  range  (from  0  to  CO  ). 

If  the  noise  spectrum  is  equidimensional  in  a  restricted  frequency  range 
and  equals  zero  outside  this  range,  then  such  a  fluctuating  oscillation  is  referred 
to  as  noise  with  an  equidimensional  restricted  spectrum.  Noise  with  a  non- 
equidimensional  spectrum  is  referred  to  as  non-white  noise. 

Since  the  white  noise  correlation  function  has  the  form  of  a  delta  function, 
such  noise  is  referred  to  as  uncorrelated  noise.  As  opposed  to  this,  non-white 
noise,  including  that  with  an  equidimensional  spectrum  in  a  restricted  band,  is 
correlated  noise. 

Noise  having  a  gaussian  (normal)  law  of  distribution  (here,  there  may  be 
any  spectrum)  is  referred  to  as  gaussian  (normal)  noise. 

In  accordance  with  this  terminology,  normal  white  noise  is  referred  to  as 


a  stationary  random  process  having  a  normal  law  of  distribution  and  an  squid imensional 
frequency  spectrum. 

It  should  be  noted  that  white  noise,  after  passage  through  any  real  linear 
network,  is  converted  at  network  output  into  a  process  with  a  normal  law  of  dis¬ 
tribution.  This  provides  the  basis  to  assume  that  the  law  of  white  noise  distribution 
is  normal. 

1.3  Expansion  of  Time  Functions  Into  Series  With  Respect  to  Orthogonal  Functions 

Expansion  of  time  functions  being  investigated  into  series  with  respect  to 
some  known  ("standard")  orthogonal  time  functions  is  widely  used  in  the  theory  /23 
of  optimal  reception  methods  for  mathematical  description  of  signals  and  noise. 

Use  of  such  expansions  in  many  instances  makes  it  possible  significantly  to  simplify 
analysis  and  to  make  it  clearer,  especially  if  one  is  dealing  with  random  time 
functions. 

The  most  widespread  is  the  Kotel'nikov  expansion  [33],  which  he  obtained 
for  time  function  f(t)  having  a  spectrum  restricted  to  frequencies  from  0  to 
fB.  .  This  expansion  has  the  form 


(1.11a) 


(1.11b) 


(1.11c) 


In  addition,  it  follows  from  (1.11a)  and  (1.11c)  that 
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Figure  1.8 


Coefficienta  of  aariea  (1.11a)  are  values  of  function  f(t)  taken  after 


discrete  time  intervals  At  (Figure  1.8). 


If  function  f(t)  with  a  restricted  spectrum  is  considered  only  during  finite 
time  interval  T  (Figure  1.8),  precise  expansion  (1.11)  is  replaced  by  the  /24 
following  approximate  expansion: 


/(0~£  /**,(/). 


(1.12! 


where  fnwm f(*&t);  A/  —  -L;  fnT\ 

*r  ■ 

ft  ft AO 


(1.121 


r  0  where  l^kk; 

*  Vt  where  l  —A. 


(1.12 


Comparison  of  formulas  (1*11)  and  (1.12)  demonstrates  that  approximate  rela¬ 
tionships  (1.12)  are  obtained  from  precise  simple  truncation  of  infinite  sums, 
i.  e.,  by  restricting  them  by  those  values  f^  of  function  f(t)  which  fall  within 
the  limits  of  time  T  (Figure  1.8).  As  will  be  demonstrated  below,  the  expansion 
(1.12)  error  is  slight  if  n^l. 

If  function  f(t)  has  relatively-narrow  spectrum  E(f)  restricted  by  frequencies 
f^  and  fg  (Figure  1.9),  then  it  often  is  convenient  to  represent  it  in  the  form 


Figure  1.9 


of  an  oscillation  modulated  with  respect  to  amplitude  and  with  respect  to  phase: 

/(O-r(/)co$|2^,/+0(/)),  U-13) 

where  }%  —  —  —  average  frequency  of  the  spectrum*,  while  r(t)  and  6 (t) 

—  amplitude  (envelope)  and  initial  phase  of  oscillation  f(t),  respectively. 

Here,  along  with  approximate  expansion  (1.12),  it  also  is  possible  to  use 
the  following  approximate  expansion  for  time  interval  (T)  [17,  page  214]: 

*It  is  possible  to  introduce  f  by  another  method  as  well. 


(1.14) 


f  <0 -2  ¥*'  (/)  rk  cos  (2a/,  / + 0h), 

A*  I 


where  r,  »  r  (4  A/);  0,  »  0  (k  A/); 


T_ 

St 


'  Af  •  T; 


V(0 


slnwA/(/—  t  &<) 
n  A/  (/—  *  St) 
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Since  similar  to  <(t),  are  standard  orthogonal  functions,  then 

function  f(t)  completely  is  determined  by  values  rk  and  6 k  of  its  amplitude  and 
phase  taken  after  time  interval  At  a  1/Af.  The  total  number  of  these  values  is 
2n^  a  2AfT  (n^  values  of  amplitude  and  n^  values  of  phase),  i.  e.,  just  as  many 
as  was  the  case  for  expansion  (1.12),  which  uses  instantaneous  values  of  function 
f(t). 


At  the  foundation  of  all  these  expansions  is  the  assumption  that  the  spectrum 
of  the  expanded  function  is  restricted  by  band  Af,  i.  e.,  does  not  have  components 
outside  this  band.  This  assumption  naturally  is  an  idealization  since  the  spectrum 
of  the  oscillations  at  the  output  of  actual  physical  systems  always  has  several, 
albeit  slight,  "tails”  rather  than  a  complete  cutoff  outside  any  finite  band. 


It  is  possible  to  demonstrate  (see  [52]  and  [74],  for  example)  that  the  assump¬ 
tion  concerning  the  finitude  of  the  spectrum  width  especially  is  undesirable  from 
the  point  of  view  of  principle  when  approximating  random  time  functions  since, 
strictly  speaking,  any  random  function  must  have  an  infinitely -wide  spectrum. 
Therefore,  to  expand  random  time  function  f(t),  it  is  possible  to  select  interval 
width  At  between  neighboring  values  f(kAt)  of  this  function,  stemming  not  from 
the  width  of  the  function  f(t)  spectrum,  which  should  be  considered  infinitely 
large,  but  from  its  correlation  interval  *tQl  i.  e.,  to  assume 
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Here,  expansion  into  a  series  will  be  valid  only  in  the  event  that  this  interval 
is  much  less  than  observation  cycle  T,  i.  e. ,  only  where 


T,  A<  * 


(1.16) 


In  other  words,  it  would  be  more  natural  when  expanding  random  time  functions 

to  begin  not  with  the  assumption  concerning  the  finitude  of  the  width  of  their 

spectrum,  but  from  the  assumption  concerning  inequality  with  zero  of  their  correlation 

interval  X  . 

o 

However,  expansion  methods  based  on  the  assumption  concerning  the  spectrum 
constraint  are  used  more  widely  both  for  determinate  and  for  random  time  functions. 

A  large  number  of  results  important  for  practice  and  proven  have  been  obtained 
with  their  help.  Therefore,  in  future,  we  will  use  these  very  expansions  to  present 
the  material. 

We  will  examine  the  simplest  of  them  in  more  detail,  specifically  Kotel'nikov 
expansions  (1.11)  and  (1.12). 


As  already  noted,  expansion  (1.11)  is  precise  if  function  f(t)  has  a  spectrum 
restricted  by  band  /„  .  This  formula  provides  a  relative  error,  whose  mean  square 
has  a  magnitude  on  the  order  of  AE/E  [66],  for  real  functions  with  an  unrestricted 
band.  Here,  E  —  complete  energy  of  the  spectrum  (unrestricted  with  respect  /26 
to  width),  while  AE  —  energy  of  the  spectrum  "tail",  i.  e. ,  that  part  of  it  located 
outside  band  /.  .  Consequently,  the  expansion  (1.11)  error  is  infinitesimally 
small  if  only  a  relatively -small  amount  of  the  studied  function's  spectrum  energy 
is  concentrated  outside  band  /*  . 

Expansion  is  used  in  the  theory  of  optimal  reception  methods  mainly  for  oscil¬ 
lation  y(t)  arriving  at  receiver  input.  Here,  usually  it  is  possible  to  consider 
function  y(t)  spectrum  width  /,  as  large  as  desired,  and  formula  (1.11)  here  may 
be  considered  very  precise.  In  addition,  in  a  majority  of  instances,  use  of 
expansions  (1.11)  and  (1.12)  is  only  an  intermediate  mathematical  drill— during 
the  subsequent  computations  following  expansion,  an  opposite  transition  occurs 
from  series  (sums)  to  convolute  expressions  (integrals),  in  which  spectrum  width 
is  not  included,  and  therefore  can  be  a  large  as  desired. 
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Thus,  in  a  majority  of  cases,  use  of  expansion  (1.11)  will  not  introduce 
any  errors,  major  or  otherwise,  in  spite  of  the  fact  that  real  functions  have 
a  theoretically-unrestricted  spectrum.  Expansion  (1.12),  as  opposed  to  (1.11), 
is  approximate,  even  if  function  f(t)  has  a  restricted  spectrum. 

Actually,  formula  (1.12)  was  obtained  from  precise  (given  a  restricted  band) 
formula  (1.11)  by  means  of  simple  truncation,  i.  e.,  by  discarding  those  parts 
of  function  f(t)  which  are  outside  interval  (T)  (Figure  1.8).  Therefore,  expansion 
(1.12)  will  not  provide  an  error  compared  to  (1.11)  only  when  entire  function 
f(t)  actually  is  concentrated  in  interval  (T).  But  the  function,  concentrated 
in  a  finite  time  interval,  in  principle  may  not  have  a  restricted  spectrum.  Thus, 
expression  (1.12)  in  principle  may  not  be  precise  if  one  simultaneously  assumes 
that  both  spectrum  width  f%  and  observation  cycle  T  duration  are  finite,  i.  e., 
if  the  number  2faT  of  discrete  values  of  this  function  are  finite. 

For  the  formula  (1.12)  error  to  be  slight,  this  condition  must  be  met 

«=2/,r>l.  (1.17) 

This  inequality  essentially  may  be  satisfied  by  increasing  both  T  and  f ,  . 

Formula  (1.12)  coincides  with  (1.11)  when  T -*■  oo  (and  with  finite  /■  ) 
and  only  the  aforementioned  shortcomings  linked  with  the  assumption  of  the  finitude 
of  spectrum  width  remain. 

If  f9-*-  oo  ,  then,  as  is  easy  to  demonstrate,  expansion  (1.12)  will  become 
precise  even  for  a  finite  T.  Actually,  it  follows  from  expressions  (1.12a)  and 
(1.12b)  that,  at  discrete  moments  kAt,  equality 

MO-SfafcW 

I 

is  satisfied  precisely  sines  /27 


tin 2n/, </-»&/)  ^  ||  where 
2 «/■(*— *&0  (o  where  t—l&t, 

Consequently,  the  inexactitude  of  series  (1.12a)  may  manifest  itself  only 
within  each  interval  At,  and  not  at  its  ends.  But,  when  /»— ■  <>o  ,  each  interval 

At  will  strive  towards  zero  (Figure  1.8),  the  edges  of  this  interval  merge,  and, 


35 


consequently,  series  (1.12a)  will  become  precise  not  only  for  discrete,  but  also 
for  all  moments  in  time  in  the  ranges  examined  from  0  to  T. 

Thus,  when  /*-*■<»,  expansion  (1.12)  is  made  precise  even  during  finite 
observation  cycle  T. 

As  already  noted  above,  in  the  theory  of  optimal  reception  methods,  expansion 
of  function  f(t)  into  series  (1.12)  usually  is  only  that  intermediate  mathematical 
operation  in  which  the  spectrum  /»  width  is  not  included  and,  therefore,  it  may 
be  considered  as  large  as  desired.  Here,  use  of  expansion  (1.12)  provides  no 
errors  of  any  kind.  Its  use  is  convenient  because  function  f(t)  completely  is 
determined  by  finite  number  n  of  its  discrete  values  f^,  usually  referred  to  as 
sample  values.  These  values  f^  also  may  be  considered  the  coordinates  of  function 
f(t)  in  n-dimensional  space  (see  §  6.3). 

Use  of  expansion  (1.12)  is  especially  useful  if  f(t)  is  a  random  time  function. 
Actually,  in  accordance  with  this  expansion,  function  f(t)  completely  is  determined 


Figure  1.10 


by  its  n  ordinates  f^,  f2»  .  .  .,  fn  taken  at  discrete  moments  in  time  with  some 
interval  =»  I/2/s  (Figure  1.10).  Therefore,  probability  dP,  such  that  realization 
of  f^(t)  of  random  time  function  f(t)  will  be  fixed  within  the  boundaries  of  an 
infinitely-thin  layer  restricted  in  Figure  1.10  by  dotted  and  solid  lines,  equals 
the  composite  probability,  such  that  ordinates  f^,  f^,  .  .  .,  fn  will  be  located 
accordingly  within  the  limits 


dP=PUi,ft . fn)dfxdft...dfn. 


where  P(f^,  f£,  •  •  •*  fR)  --  n-dimensional  probability  density. 

Consequently,  the  random  time  function  completely  is  characterized  by 
n-dimensional  probability  density  P(f^,  f£»  .  .  fn)  given  for  the  entire  range 
of  possible  values  of  ordinates  f^,  fj,  .  .  .,  fn« 

We  will  find  as  our  example  the  n-dimensional  distribution  for  normal  white 
noise,  i.  e. ,  for  stationary  random  time  function  um(/),  having  a  normal  distribution 
and  ojuidimensional  frequency  spectrum.  It  should  be  assumed  that  the  spectrum  /28 
of  this  noise  is  limited  by  the  band  (0  -r /■)  in  order  for  expansion  (1.12)  to 
be  applicable. 

The  following  relationships  are  valid  for  such  noise: 


«m(0  =  «®“0; 

(1.19a) 

(1.19b) 

S„(I)-N,-£  where  ,_0+f>.  | 

| 

(1.19c) 

Sn,  (/)  =  0  outside  these  limits;  ‘ 

,  m  '  ’W”’ 

(1.19d) 

where  Sm{f)  —  noise  energy  spectrum;  N  —  noise  voltage  mean  square  equalling 
its  dispersion  (since  «m  =0);  parameter  N  may  be  considered  just  like  the  noise 
power  density,  i.  e.,  power  developed  by  voltage  um(t)  across  a  1  ohm  resistance; 
Nq  —  noise  power  density  arriving  per  unit  of  band;  —  unidimensional 

law  of  noise  distribution. 


We  will  find  the  noise  normalized  correlation  function  p(r)  where 


p  (T)  _  (0  «b  « -bt) 

'  '  Jn* TO  (1.20) 

Function  p(x)  is  coupled  with  energy  spectrum  Sm([)  by  the  following  known 
relationship  [20]: 


P  (T)  “  ===  $  {f)  cos  2nfrdf. 

“m*  (0  o 

Considering  relationship  (1.19c),  we  obtain 


p(*)=  Ntcos2nfxdf 

n 


tin  2 nfi  t 
2n/,t 


(1.21) 


(1.22) 


A  curve  with  respect  to  this  formula  is  plotted  in  Figure  1.11,  from  which 

it  is  evident  that  there  is  no  cross-correlation  of  voltage  values  um( /)  separated 

by  interval  1/2/,  .  Hence  it  follows  that,  when  noise  voltage  is  represented  as 

series  (1.12>,  ordinates  f, ,  .  ,  . ,  f  included  in  this  series  should  be  considered 

i  n 

uncorrelated.  In  addition,  as  follows  from  (l.l9d),  these  ordinates  are  subordinate 
to  the  normal  law  of  distribution.  But,  it  is  known  from  the  theory  of  probabilities 
that,  given  a  normal  law  of  distribution,  the  absence  of  correlation  signifies 
also  an  absence  of  statistical  conjunction,  i.  e.,  statistical  independence  [20], 


Therefore,  in  a  case  of  normal  white  noise  with  a  limited  band,  ordinate 
values  fp  fj,  .  •  .»  fn  are  statistically  independent  magnitudes  and  n-dimensional 
probability  density  P(f^,  .  .  .,  fR)  equals  the  product  of  the  equidimensional 
probability  densities: 


P  (f.,  /, . /„)  -  P  (ft)  P  (ft)  P  (U  (1.23) 

Therefore,  in  accordance  with  (1.19d)  and  (1.23),  it  is  possible  to  write 

W  m  0*m)  "  P  (f,mi  •  •  •  ■  •  ,,mn)  “ 


where  n  =  2  f,  T. 


Considering  (1.12d)f  it  is  possible  to  represent  expression  (1.24)  also  in 
the  following  forms 


"•  <“■>-  7iW“P  ■  (1-25) 

It  follows  from  expressions  (1.24)  or  (1.25),  in  particular,  that  the  greatest 
probability  density  corresponds  to  zero  realization  of  noise  voltage  um(/)  , 
i.  e.,  to  that  realization  which  equals  zero  in  entire  observation  cycle  T. 


39 


CHAPTER  TWO 


/29 


OPTIMUM  LINEAR  FILTERS 
2.1  Introductory  Notes 

The  problem  of  optimum  linear  filters  is  illuminated  in  detail  in  several 
domestic  and  translated  books  [3,  8,  10-12,  136,  and  others].  Therefore,  we  will 
dwell  briefly  only  on  two  of  the  most -characteristic  variants  of  all  those  ways 
of  formulating  the  problem  of  optimum  linear  filters — a  filter  creating  the  maximum 
signal-to-noise  ratio  at  its  own  output  and  a  filter  providing  minimal  mean  square 
error  in  signal  reproduction. 

As  will  be  demonstrated  in  §  2.5,  the  basic  field  of  use  for  the  first  type 
of  filter  (i.  e.,  those  creating  the  maximum  signal-to-noise  ratio)  is  filtration 
of  modulated  signals,  i.  e.,  filtration  preceding  signal  detection  (demodulation). 
Here,  the  job  of  the  optimum  filter  is  to  provide  best  separation,  not  of  /3Q 
entire  modulated  signal  ux(t)  on  the  noise  background,  but  only  useful  message  x 
carried  on  it,  which  is  a  parameter  of  this  signal. 

It  will  be  demonstrated  in  §  2.5  that  the  first  type  of  optimum  filter  in 
many  cases  is  a  component  part  of  the  optimum  receiver. 

It  is  advisible  to  use  filters  of  the  second  type,  as  shown  in  §  2.5,  in 


those  instances  when  entire  signal  uc(t),  rather  than  one  (or  several)  of  its 
parameters,  is  the  useful  message  subject  to  separation  from  the  noise.  Therefore, 
such  filters  may  be  used,  for  example,  in  receiver  stages  connected  beyond  the 
detector  (demodulator),  as  well  as  in  various  automated  and  telemechanical  devices. 


2.2  Linear  Filters  Providing  Minimum  Mean  Square  Error 

I 

Classic  formulation  of  this  problem,  posed  by  N.  Wiener  as  early  as  1941  [28], 


y(t) 


JhiHeuNvu 

Qujitmp 


rfti 


Figure  2.1.  (a)  —  Line  filter. 


involves  the  following  (Figure  2.1).  The  sum  of  signal  and  noise  arrives  at 
stationary  linear  system  (filter)  input: 


I 


y(t)  *UC(0  +  »□.(')• 


(2.1) 


Both  noise  and  signal  are  considered  stationary  random  processes  with  zero 
mean  values  and  known  correlation  functions  Rc(  T  )  and  J?m(T)  .  If  oscillations 
uc(t)  and  um(i)  are  cross-correlated,  then  the  function  of  their  cross  correlation 
/?cm( j)  also  is  assumed  to  be  known  and  does  not  depend  on  the  time  reference. 

{  Since  a  filter  is  a  stationary  linear  system  according  to  the  condition, 

then  it  is  determined  completely  by  its  transfer  function  /((/u>)  or  pulse  unit 
%  step  function  *1(0  .  As  is  known,  K(/“)  and  n(0  are  linked  by  a  Fourier  transform: 


I 


$  K(/®)e'«'d«o, 

2  n  -m 


(2.2) 


K(/*)'-  $  ri  (0  e-**‘  dt. 


(2.3) 
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For  a  physically-realized  filter,  this  condition  must  be  satisfied*  /31 


where  ^<0. 


(2.4) 


and  relationship  (2.3)  takes  the  form 


K  (/<•>)=$  rj  (/)  e-/«<  dt. 


(2.3a) 


The  task  of  the  linear  filter,  along  with  removing  noise  from  the  signal, 
also  may  include  some  sort  of  linear  conversion,  such  as  amplification,  differt  i- 
tiation,  integration,  and  so  on.  Therefore,  generally  speaking,  certain  time 
function  h(t)  coupled  with  the  signal  by  a  given  linear  conversion  must  be  obtained 
at  filter  output  when  noise  is  absent.  For  instance: 

h (/) M t  if  differentiation  is  required; 
dt 
t 

/»(/)—  | ue  (t)dt,  if  integration  is  required; 

h(t)  A/),  if  a  time  shift  by  At  is  required; 

h  (/)  — aue  (/),  if  amplification  by  a  factor  of  a  is  required; 

A  (/)  *•  ue  (/),  if  simple  signal  reproduction  is  required.  Consequently,  in  the  general 
case,  there  is  a  requirement  to  obtain 


(2.5) 


where  ^(p)  —  linear  transformation;  p  =  d/dt. 

Oscillation  (t)  at  linear  system  output  is  linked  with  oscillation  y(t) 
at  input  by  known  relationship 


y(t)-  I  y(f— ■*)•  iW** 


(2.6) 


♦In  addition,  function  *1(0  must  (at  the  corresponding  rate)  strive  towards 
zero  when  t-*  »  ;  but,  in  a  majority  of  instances,  condition  (2.4)  is  limiting. 
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In  the  case  of  a  physically-realized  filter,  this  relationship,  in  accordance 
with  (2.4),  takes  the  form 


v(0“  §  y{t— t)n(t)dt. 

o 


(2.6a) 


Therefore,  when  both  signal  and  noise  are  present  at  input,  at  output  there 
is 


?{/)=$  [«,(/— t)+iib(/— t)Jn(T)dT.  (2.7) 

o 

The  required  oscillation  at  filter  output  is  h(t).  Therefore,  error  /32 
8(0.  obtained  at  filter  output  equals 


•  W-v(0— *(/). 

The  mean  square  of  this  error  equals 


where  formula  (2.7)  determines 


(2.8) 


(2.9) 


The  optimal  filter  is  one  with  minimum  magnitude  e*  .  Consequently,  mathe¬ 
matically  the  problem  boils  down  to  finding  that  type  of  pulse  transient  character 
istic  tj(0  included  in  formula  (2.7)  in  which  magnitude  (2.9)  is  minimum.  Wiener 
solved  this  problem  using  calculus  of  variables  approaches;  it  was  found  that 
desired  characteristic  q(Q  of  the  optimim  linear  filter  must  be  solution  of  the 
following  integral  equation  (see  [3],  for  example): 


|  R,  (T  0  ’I  (0  dt  *■  Rth  (t)  where  t  >  0, 


(2.10) 


where  R,{x)  =  y(t)y{t+x) 
and  *»»(*)  -vfiW+i) 


(2.11) 


are  correlated  functions  assumed  to  be  known. 


If  the  signal  and  noise  statistically  are  independent,  then 


where 


#,h(T)“«e(OM/+T).l 
^,(T)=Re(T)  +  ^B,  (t),| 

5 

^eW"«0  (0««(/  +  T) 

K0.  (*)=■««  U)Um(t  +  i) 

If,  in  addition,  only  signal  amplification  is  required,  i.  e., 


(2.12) 


Rlh  (t)  -  aRc  <t). 


(2.13) 


Solution  of  equation  (2.10)  leads  to  the  following  expression  for  the  optimum 
linear  filter  transfer  function  K(j&)  [12]s 


m  m 

K  (/»)  -  - —  [  e-i*  dt  $  J*  (M)  e/-«  do). 

2**  (/•)  $  .i,  ♦(-/«) 


(2.14) 


where 


1 9  (/»)  |  ’  —  5,  («>). 


(2.15) 
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Here,  S¥h  M  and  SfM  —  power-density  spectra  corresponding  to  correlation 
functions  Rth  (T)  and  Rt  (t)  ,  i.  e. , 


S„M“  7 


(2.16) 


Since  correlation  functions  Rvh(r)  and  Ru(?)  are  assumed  as  known  during  solution 
of  the  problem,  then  power  spectra  and  St( a)  also  are  known.  Therefore, 

formula  (2.14)  computations  require  only  determination  of  complex  magnitude  W/’®). 
and  modulus  |  $(/«•) |,  which  preliminarily  will  be  found  from  relationship  (2.15). 


The  identical  link  to  that  between  line  system  transfer  function  and 
its  modulus  |K(/w)  I  exists  between  functions  Mi}*)  and  ♦(—/**)  .  Therefore,  the 

same  methodology  may  be  used  for  computations  of  Mi*)  with  respect  to  the  magnitude 
I  Ml°>) I  found.  This  methodology  is  presented  in  §  18.3. 


The  minimum  mean  square  error  determined  by  the  following  formula  corresponds 
to  the  transfer  function  determined  by  relationship  (2.14) 


(•)—!*  (/«>)  I  *  S,  MI  da. 


(2.17) 


where 


Sh  M  -  $ 


is  the  function  h(t)  power-density  spectrum. 

Practical  computations  with  formula  (2.14)  turn  out  to  be  rather  enormous, 
while  the  formula  itself  is  slightly  unclear.  A  more  convenient  formula  replacing 
(2.14)  is  presented  in  §  18.3.  However,  the  computations  here  also  remain  enormous 
(see  §  18.3).  They  are  simplified  significantly  if  the  filter  is  not  tasked  with 
physical  realization,  (2.4),  i.  e.,  to  assume  that  the  tower  limit  in  formula  (2.3a) 
and  expressions  stemming  from  it  equals  -40  rather  than  zero. 


Here,  as  Bode  and  Shannon  demonstrated  for  the  first  time,  optimum  filter 
transfer  function  /C(j®>)  is  determined  by  the  following  simple  relationship  [12]: 


the  possibility  of  selecting  the  optimum  filter  characteristic  and,  as  a  result, 
may  only  worsen  rather  than  improve  the  final  result. 


The  optimum  linear  filter  determined  by  relationship  (2.10)  or  (2.14)  is 
referred  to  as  a  Wiener  filter  and  corresponds  to  a  case  where  signal  u  (t),  noise 
um(f)  ,  and  synthesized  filter  are  stationary.  In  future,  the  task  of  optimum 
linear  filtration  was  generalized  for  the  case  of  a  transitory  signal,  noise, 
and  filter.  Here,  as  was  the  case  with  Wiener,  it  was  assumed  that  signal  and 
noise  have  zero  mean  values  and  known  correlation  functions. 


In  this  more  general  case,  optimum  linear  filter  impulse  response  ii(/,  t)  is 
determined  by  integral  equation 

* 

j'Rt(r,s)r\(t,s)ds^Ryh(t,T),  (2.21) 

If  impulse  response  *1(1.  *)  satisfies  this  equation,  then  the  mean  (with  respect 
to  realizations)  square  error,  e*(/)  ,  turns  out  at  each  moment  in  time  to  be  minimal 
and  equals 


(2.22) 


In  formula  (2.21),  functions  Rt(r,s)  and  R9h(t,  t)  ,  respectively,  are  /35 
correlation  functions  RfVu  *•)  of  input  combination  y(t)  and  cross-correlation 
functions  /?**(/(,  /*)  between  y(t)  and  required  output  effect  h(t)  from  which  the 
corresponding  precise  definition  of  the  arguments  is  derived  (for  example,  /?„*(/,  ?) 
is  obtained  from  RfhWu  **)  ,  with  t  replacing  t^  and  %  replacing  t2). 

Since  processes  y(t)  and  h(t)  in  this  case  may  be  transitory,  then,  as  opposed 
to  equation  (2.10),  the  correlation  functions  will  depend  on  t  and  ,  and  not 
only  on  .  In  addition,  since  the  synthesized  system  may  turn  out  to  be  transitory, 
its  impulse  response  equals  n(/t  t)  ,  rather  than  v(0  or  ~ T) .  The  ranges 
of  integration  in  equation  (2.21)  are  changed  compared  to  (2.10)  since,  in  this 
case,  it  is  proposed  that  observation  begin  at  moment  tQ  and,  consequently,  considers 
a  transitory  process  caused  by  connection  at  moment  tQ  of  the  system  itself  or 
onset  of  input  mixture  y(t)  at  that  moment,  which  is  the  same  thing. 


In  the  Wiener  case  examined  previously,  all  processes  in  the  system  are 
stationary,  i.  e. ,  it  is  assumed  that  tQ  =  -00  for  finite  value  t  (or  it  is 
assumed  that  t  strives  towards  infinity  when  t  =  0,  which  is  the  same  thing). 
i  It  is  not  difficult  to  be  convinced  that  equation  (2.10)  may  be  obtained  from 
(2.21)  as  a  partial  case  occurring  when  tQ  =  -  OQ  , 


I 


—  Ryh(tt  tt), 

*■(*!  —  * *)  11  ,l('*T)=,l(/—  T>- 

Consequently,  equation  (2.21)  may  be  considered  as  a  generalized  (for  a  stationary 
case)  Wiener  formula. 

If  noise  Um(0  has  the  form  of  white  noise  and  there  is  simple  message  repro- 
diction,  then 


U, 


and 


RjH  {tit  /,)  33  Rr  (tl<  tj)t 


where  5in0  —  noise  spectral  density  (bilateral). 


Here,  as  it  is  easy  to  see,  equation  (2.21)  acquires  the  following  form: 


J'  R „  (t,  s)  rj  (t,  s)  ds  +  Sm9  n  (/,  t)  =  Rr  (/.  t).  <  t  <  /. 

u  . 


(2.21a) 


In  the  general  case,  there  is  no  success  in  finding  the  precise  analytical 
solution  for  integral  equation  (2.21).  At  the  present  time,  it  is  obtained  only 
for  a  case  where  u  (t)  and  um(t)  are  stationary  random  processes  (see  [127], 

C 

for  instance).  There  is  success  in  obtaining  a  precise  solution  of  equation  (2.21) 
for  transitory  u  (t)  and  um(f)  only  for  several  special  types  of  correlated 


functions  R„lt,  t)  and  /?»*(/,  i).  Thus,  the  method  Shinbrot  presented  in  [194]  makes 


V»  r*  v  •»  V  *  V  %'  V  *-•  V  • 


•  m  ■*  H  •  W  -  *. 


A- 
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it  possible  to  obtain  a  precise  solution  if  the  correlation 
functions  can  be  presented  in  the  following  form: 

<? 

/?„(/,  t)=  J1  a9(t)b9(  t)  with  T^’ 

('. *) -  7  a, (t) b,  (/)  With  *>* 

♦=i 

and 

<? 

*„*(/.  T)« 

Q 

in  which  regard  2^  (o4  (/)fr,,{t)—  =  IT(/— t),  i.e.,  it  depends 

only  on  the  difference  (t  -  t). 

It  should  be  noted  that  the  theory  of  optimum  linear  filtra¬ 
tion  continues  to  develop  continously.  In  recent  years  a  large 
contribution  to  the  development  of  this  theory  was  made  by 
r.  Kolman.  In  the  Kolman  formulation  the  characteristics  of  the 
signal  u  (t)  [or  the  totality  of  signals  are  not  given 

C  G 

directly.  Only  the  differential  equation  which  it  satisfies  is 
known . 

Such  a  more  general  and  complex  formulation  of  the  filtration 
problem  proves  to  be  especially  useful  in  the  case  of  accomplishing 
optimum  linear  filtration  in  complex  radio  control  spectrum.  Here 
the  structure  of  the  optimum  filter  also  proves  to  be  complex  and 
often  can  be  practically  realized  only  on  the  basis  of  digital 
electronic  computers.  A  brief  presentation  of  R.  Kolman's  theory 
is  given,  for  example,  in  the  book  by  R.  Lee  [153] • 

2.3*  Linear  Filters  Which  Ensure  Maximum  Signal/Noise  Ratio. 

Let  the  sum  of  signal  and  noise: 

y  (/)  ™  Me  (/)  +  um  (0» 

operate  on  the  input  of  a  linear  filter  (Fig.  2.1)  where  is 

white  noise  described  by  relatlonshipis  (1.19a,  b,  c,  d),  and 

is  a  signal  with  known  amplitude  spectrum  ,  so  that 


Hr  (0  *  j-J-  $  S(/o>)e*'dM. 


(2.23) 


Since  the  filter  is  a  linear  system,  oscillation  If  (t)  at  filter  output  equals: 


y  (t)  =  ue  (0  +  um  lu,(0>  (2.24) 

where  components  uetilK(t)  and  unmMx(0  are  caused  by  the  action  of  signal  u£(t) 
and  noise  ua(t)  ,  respectively. 

If  K(Jto)  — .  filter  transfer  function,  then  /37 

ue  §  a:  (/<!>)  S  (/«•)  rf.rt,  (2.25) 

*0 

t/n«  =  «’~(7j-£  $|K(/«d)|’</<d.  (2.26) 

Let  t  be  some  fixed  moment  in  time  in  which  voltage  “c»Hx(f)  attains  value 
“ciwx^o)  .  Then,  in  accordance  with  (2.25) 


°e  bmx  (f«)  $  K  (/<•*)  S  (/’(d)  e/“'»  dm. 


(2.27) 


Ue  will  designate 


r  =  Ue .  (2.28) 

Um 

It  is  evident  that  r  is  the  ratio  of  signal  output  voltage  (at  moment  tQ) 
to  noise  output  voltage  mean  square  value  or,  briefly  stated,  the  ratio  of  signal 
(at  moment  tQ)  to  noise  at  filter  output.  The  filter  in  which  this  ratio  is  the 
greatest  is  considered  optimum. 

From  (2.25)— (2.28)  it  follows  that 


-L  \  K  (/•)  S  (/») 

2n_r_ 


therefore,  mathematically  the  problem  boils  down  to  finding  that  form  of  filter 
transfer  function  K(jio)  in  which  expression  (2.29)  attains  the  highest  value. 
North  solved  this  problem  in  1943  using  calculus  of  variables  approaches  [113]. 
In  1946,  Van  Vleck  and  0.  Middleton  independently  solved  the  same  problem  with 
the  aid  of  the  Bunyakovskiy-Shvarts  inequality  (this  method  is  presented  in  [8], 
for  instance). 

The  result  of  this  solution  is  that  r  =  rn»*e.  if 


K  (/©)  ™aS*  (/to) 


(2.30) 


where  a  —  arbitrary  constant  coefficient,  while 

S*  (/©)»S(— /©) 


(2.31) 


is  a  magnitude,  complex  conjugate  to  signal  spectrum  5(/<»)  or,  briefly  stated, 
a  complex  conjugate  signal  spectrum. 


(2.32) 


\  1 5  (/«)  |  *  *»  J  hc*  (/)*=(?. 


(2.33) 


where  Q  —  signal  energy  at  filter  input. 


Therefore,  (2.32)  also  may  be  written  in  the  following  form: 


(2.34) 


It  follows  from  this  expression  thst  magnitude  rm»ne  ,  equalling  ratio 
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[-(7^— 1m«c  »  will  not  depend  on  t  if  filter  transfer  function  K(ja>) 
is  selected  for  each  new  t  value  in  accordance  with  (2.30). 

In  other  words,  we  may  insure  maximum  signal- to-noise  ratio  determined  by 

formula  (2.34)  for  any  predetermined  moment  in  time  tQ.  All  this  requires  is 

that  filter  transfer  function  K(j*»)  be  selected  from  formula  (2.30),  in  which  t 

’  o 

is  included  in  the  form  of  factor  accomplishing  an  output  voltage  shift 

over  time  by  magnitude  tQ.  This  result  is  fully  understandable  since,  due  to 
the  noise  being  stationary,  a  shift  over  time  in  noise  voltage  does  not  change 
its  mean  square  Um*  ,  while  a  shift  over  time  of  signal  voltage  displaces  without 
distortions  signal  output  voltage  over  time  by  the  required  magnitude. 

It  also  follows  from  this  that  voltage  “c»%ix(0  attains  its  maximum  (peak) 
value  at  moment  t  since,  otherwise,  it  would  be  possible  during  a  certain  additional 
shift  over  time  to  obtain  a  greater  signal-to-noise  ratio  value  than  follows  from 
(2.34),  which  is  impossible. 

Thus,  the  optimum  filter  provides  the  maximum  peak  signal  voltage  value  ratio 
to  the  noise  voltage  mean  square  value. 

It  follows  from  (2.30)  that 


I KKM I  *=  a  |  S  (/o>)  | ,  (2.35) 

i.  e.,  the  optimum  filter  frequency  characteristic  coincides  (precise  to  a  constant 
factor)  with  the  signal  amplitude  spectrum.  Therefore,  this  optimum  filter  often 
is  referred  to  as  a  matched  (with  the  signal)  filter. 

In  order  to  solve  the  problem  of  the  possibility  of  physical  realization 
of  a  (2.30)-type  filter  transfer  function,  we  will  find  impulse  transient  char¬ 
acteristic  *1(0  corresponding  to  it. 

Substituting  (2.30)  and  (2.31)  in  (2.2),  we  obtain  /39 


(2.36) 


On  the  other  hand,  it  follows  from  (2.23)  that 

«C  <*•-/)-—  S  S  (/«.»)  eW'.-OrfM. 

Comparing  this  expression  with  (2.36),  we  obtain  the  following  expression 
for  the  optimum  filter  impulse  transient  characteristic: 

0-  (2.37) 

Since  the  physical  realization  condition  has  the  form 

q(/)=0  where  /<0, 

then  meeting  this  condition  requires  that 

u„  (/,  — /)  =  0  where  /  <  0. 

i.  e., 

ue  (/)  =o  where  t>t„  (2.38) 

Consequently,  the  optimum  filter  is  physically  realizable  only  when  condition 
(2.38)  is  met.  This  condition  signifies  that  signal  u£(t)  voltage  at  filter  input 
must  disappear  prior  to  moment  tQ  when  signal  voltage  at  filter  output  attains 
its  maximum.  It  is  possible  to  meet  this  condition  for  many,  but  not  for  all, 
signal  u  (t)  types. 

C 

Actually,  let  the  signal  at  filter  input  damp  with  respect  to  exponent 
£/r**  ,  for  example.  This  signal  disappears  only  when  t — »oq  ;  thus, 

realization  of  condition  (2.38)  requires  that  voltage  at  filter  output  attain 
its  maximum  later  than  at  infinity,  which  evidently  is  impossible.  Consequently, 
i  i  optimum  filter  is  physically  unrealizable  for  the  signal  depicted  in  Figure 
2.2a. 

However,  in  many  actual  cases,  the  signal  at  filter  input  damps  sufficiently 


Figure  2.3 


We  note  the  following  in  conclusion.  It  was  assumed  during  derivation  of  formula 
(2.34)  that  extant  noise  value  Um  at  filter  output  is  determined  in  the  steady-state 
mode,  i.  e.,  when  noise  appears  at  input  at  moment  fn  ,  where  Vs*  — «°  .  However, 
it  was  demonstrated  in  [17M  that  noise  dispersion  at  matched  filter  output  attains 
its  steady-state  value  as  early  as  time  T  following  appearance  of  noise  at  its 
output  (here  T  —  signal  uc(t)  duration  with  which  the  given  filter  is  matched). 
Therefore,  if  a  matched  filter  is  connected  at  the  moment  of  signal  appearance 
at  output  rather  than  at  moment  /,  =  00  f  formula  (2.34)  remains  valid. 

We  will  examine  several  examples. 

Example  1  [8].  A  signal  has  the  form  depicted  in  Figure  2.3a,  to  wit: 

/  where  <<<,, 

RC 

««(<)=• 


o 


where  t  > 


Figure  2.4  Figure  2.5.  (a)  —  IYa  [integrating 

cell]:  (b)  —  RK  [differential  stage] 
(c)  —  LZ  [delay  line]. 


Example  2.  A  signal  comprises  a  single  square  pulse  with  duration  tQ  disappearing 
at  moment  t^  s  T  (Figure  2.4). 

Corresponding  to  this  signal  is  complex  spectrum 


S(/«>-  -i- (i -«-'«•) 


(2.42) 
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and,  in  accordance  with  (2.30)  and  (2.31),  the  optimum  filter  must  have  transfer 
function 


(2.43) 


In  accordance  with  (2.40),  we  take  tQ  s  ^  -  To;  then  (2.43)  takes  the  form 


-^(1 -*-**•).  (2.44) 

Such  a  transfer  function  is  realized  in  the  system  depicted  in  Figure  2.5. 
Integrating  cell  IYa,  having  transfer  constant  a/i*  ,  forms  the  first  factor  in 
formula  (2.44).  Delay  line  LZ,  creating  a  delay  by  time  T  in  combination  with 
differential  stages  RK,  has  transfer  constant  (t  —  corresponding  to  the 

second  factor  in  expression  (2.44). 

Optimum  filter  impulse  transient  characteristic  *l  (0  is  determined  by  formula 
(2.37)  and  has  the  form  depicted  in  Figure  2.6a  for  the  signal  being  examined 
(Figure  2.4). 
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Figure  2.6 


Ouring  practical  determination  of  function  KO  type,  instead  of  (2.37),  /42 
it  is  often  more  convenient  to  use  the  following  relationship  stemming  from  it 
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i.  e.y  Impulse  transient  characteristic 


(£-«).  (2-45> 
1(0  is  (precise  to  constant  factor  a) 


a  mirror  image  of  function  u  (t)  relative  to  point  t  /2. 

c  o 


In  the  specific  example  examined  (Figure  2.4),  when  tQ  =  ^ Q  and  function 

u  (t)  is  symmetrical  relative  to  point  t  /2,  the  mirror  image  coincides  with 
c  o 

original  and  function  t(0  has  (precise  to  a  constant  factor)  a  form  identical 

to  that  of  signal  u  (t)  voltage  (Figure  2.6a). 

c 


Filter  output  voltage  "«•»»*( 0  equals 


“e  mis  (0™  J  «« ((— t)  t|  (t)  dx. 


(2.46) 


Considering  that  functions  uc(t)  and  1  (0  have  the  form  depicted  in  Figure 
2.4  and  2.6a,  respectively,  from  (2.46)  we  will  find  that  ««  mod)  has  the  form 
depicted  in  Figure  2.6b,  i.  e. ,  the  optimum  filter  converts  a  signal  rectangular 
pulse  into  a  triangular  pulse  with  double  duration. 

Example  3.  The  signal  is  a  radio  pulse  with  a  rectangular  envelope  (Figure 
2.7).  The  pulse  has  duration  and  frequency  f  of  high-frequency  occupation. 
It  is  possible  to  solve  this  problem,  with  both  a  precise  approach  by  direct 


Figure  2.7 


computation  of  function  *</•>  using  formula  (2.30)  and  approximately  by  the 
method  of  slowly-changing  amplitudes. 

One  may  demonstrate  [71]  through  slowly-changing  amplitudes  that,  if  *(/») 
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is  the  optimum  transfer  function  for  a  video  pulse,  then  KI/(»>  —  w*)l will  be  the 
optimum  transfer  function  for  a  radio  pulse  with  the  same  envelope.  Here,  a  filter 
output  voltage  envelope  optimum  for  a  radio  pulse  coincides  with  that  optimum 
for  a  video  pulse. 

The  resultant  error  here  is  less,  the  greater  the  ratio  7^/T  ,  the  greater 
the  high-frequency  periods  T  the  radio  pulses  comprises.  These  assumptions  are 
valid  for  a  random-envelope  radio  pulse.  The  only  requirement  is  for  this  envelope 
to  exist,  i.  e. ,  that  the  amplitude  of  the  sinusoidal  oscillation  change  slowly 
enough  from  one  high-frequency  period  to  another. 

We  will  use  the  slowly-changing  amplitudes  approach  for  the  Figure  2.7  pulse. 

In  light  of  all  this,  the  optimal  transfer  function  for  this  pulse  may  be 
obtained  from  expression  (2.44)  for  a  rectangular  video  pulse  by  replacing  w 
with  (»—«»)  : 

(2.47) 

For  simplicity,  we  will  assume  that  the  radio  pulse  comprises  a  whole  /43 
number  of  periods,  i.  e.,  T  / Tq  s  m,  where  m  =  whole  number. 

Then,  (2.47)  takes  the  form 

(2.47a) 

It  is  possible  to  obtain  a  transfer  function  approximating  ,//<w  —  *'•)  by  using 
a  highly-selective  cavity  resonator,  while  term  e",w,,  is  obtained  using  a  delay 
line  with  time  delay  T  . 

However,  as  will  be  shown  in  2.4,  there  is  no  need  for  a  single  radio  pulse 


when  building  such  an  optimum  filter,  since  a  much  simpler  quasi-optimum  filter 
provides  very  similar  results. 


In  light  of  this,  an  envelope  at  filter  output  optimum  for  a  radio  pulse 
coincides  with  filter  output  voltage  optimum  for  a  video  pulse  if  the  radio  pulse 
at  input  has  the  same  envelope  as  the  video  pulse.  Thus,  given  the  Figure  2.7 
pulse,  the  pulse  envelope  at  filter  output  has  the  same  form  as  in  Figure  2.6b, 
while  the  pulse  itself  has  the  form  shown  in  Figure  2.8. 

Consequently,  an  optimum  filter  converts  a  radio  pulse  with  a  rectangular 
envelope  into  one  with  a  triangular  envelope  and  double  duration. 


Figure  2.9 

Example  4.  A  signal  comprises  train  n  of  coherent  radio  pulses  with  spacing 
Tm  ,  8  rectangular  envelope,  and  duration  (Figure  2.9). 

We  assume  that 


A.m.  Is., 
r,  ’  r,  • 


where  TQ  —  high-frequency  occupation;  m  and  1  —  whole  numbers. 


Computations  provide  this  expression  for  the  optimum  filter  complex  /44 
transfer  constant: 


*</•)- *!</«•)  A,  </«•). 
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(2.48) 


where  Kt(i»)  —  filter  complex  transfer  constant  determined  from  formula  (2.47a) 
optimum  for  a  single  radio  pulse; 

*,(/«)- 1 +t~,m**+t~,maTn+  ...  +  (2.4 

As  usual,  we  assumed  that  tQ  =  t,  when  deriving  formulas  (2.48)  and  (2.49). 


It  follows  from  Figure  2.9  that,  in  this  case 


4-4-<*-i)r»+T* 


The  Figure  2.10  optimum  filter  schematic  diagram  corresponds  to  formulas 
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Figure  2.10.  (a)  —  LZ  [delay  line]. 


(2.48)  and  (2.49)  and  depicts  filter  Kid*),  optimum  for  a  single  radio  pulse 
(aee  Example  3)  and  a  delay  line  set. 


It  is  evident  that  this  delay  line  set  may  be  replaced  by  one  delay  line 


Figure  2.11.  (a) 
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[isolating  stage]. 
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with  n~ 1  taps  and  isolating  stages  RK  (Figure  2.11). 

A  large  number  of  pulses  n  requires  too  many  lines  and  taps.  However ,  when 
n^l,  expression  (2.49)  may  be  simplified  as  follows. 

We  will  designate 


Then,  K* (/•»)«■! +*+**+  •••  Where  n— *©o,  we  obtain 


(2.50) 


This  transfer  constant  may  be  obtained  in  the  Figure  2.12  circuit  and  comprises 


’W 


Bn'**7*  i 
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Figure  2.12 


broadband  amplifier  with  amplification  Kq  enveloped  by  negative  feedback  across 


a  delay  line. 


This  circuit's  transfer  constant  equals 


*p(«- 


(2.51) 


Where 


A,p-l 


(2.52) 
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transfer  constant  Kp  (/»)  coincides  with  required  function  (in  principle, 

precision  up  to  a  constant  factor  does  not  play  a  role). 


Figure  2.13 


Optimum  filter  output  voltage  «en»(0  has  the  form  depicted  in  Figure  2.13 
(where  it  is  assumed  for  simplicity  that  n  =  3). 


It  follows  from  (2.34)  that  signal-to-noise  ratio 
is  proportional  to  signal  energy  Q: 


r*M»i«c  at  filter  output 

(2.53) 


For  the  signal  shown  in  Fig.  2.9  Q-«Q,. 

where  —  energy  of  one  pulse,  while  Q  —  energy  of  the  entire  signal  (pulse 
"packet")  comprising  n  periodic  pulses. 


Consequently, 


(2.54) 


Ue  now  will  explain  what  frequency  characteristic  the  Figure  2.10  optimum 
filter  and  its  links  Ki(fm)  and  *,(/«)  have. 


It  follows  from  (2.35)  that  optimum  filter  frequency  characteristic 
I  *(/*H  coincides  with  respect  to  form  with  signal  frequency  spectrum  I5(/«)I  . 

Therefore,  frequency  characteristic  |Af(/w)|  for  a  single  radio  pulse  with  a 
rectangular  envelope  has  the  form  depicted  in  Figure  2.14a,  while  that  for  /46 
an  infinite  periodic  train  of  such  pulses  has  the  form  depicted  in  Figure  2.14b. 
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Figure  2.14  Figure  2.15 


Heref  the  envelope  of  the  Figure  2.14b  characteristic  (dotted  curve)  coincides 
with  the  characteristic  for  a  single  pulse. 

It  follows  from  (2.48)  that 

(2.55) 

where  l,*i(/w)|  —  optimum  filter  frequency  characteristic  where  n  :  1,  i.  e. ,  the 
Figure  2.14a  characteristic. 

It  follows  from  formula  (2.55)  and  Figure  2.14a,  b  that  frequency  characteristic 
| «■«(/») |  where  n— ►  OOmust  have  the  form  depicted  in  Figure  2.14c.  Actually, 
if  you  remultiply  the  ordinates  of  the  curves  in  Figure  2.14a  and  b,  then  charac¬ 
teristic  |  K (/•)  | ,  depicted  in  Figure  2.14b  will  be  obtained.  Consequently,  the 
frequency  characteristic  of  optimum  filter  unit  A,  (/<*)  (Figure  2.10)  had,  where 
n — e»eo,  the  form  of  a  comb  with  infinitely-narrow  teeth.  One  also  may  come 
to  that  conclusion  through  direct  analysis  of  expression  (2.49). 


If  the  number  of  packet  pulses  n  is  great,  but  finite,  then  signal  frequency 
spectrum  5  (/«)  ceases  being  linear,  while  frequency  characteristic  I  Kt  (/«)  i  already 
will  comprise  teeth  of  finite  length  (Figure  2.15b)  rather  than  infinitely-narrow 
teeth  (Figure  2.14c).  The  smaller  the  number  of  pulses  n,  the  greater  the  width 
of  the  comb  teeth  and,  where  n  =  1,  characteristic  I  (/<•>)  i  is  converted  into  a 
horizontal  straight  line  (Figure  2.15a). 


From  this  example,  it  is  possible  to  draw  the  following  conclusions  about 
the  structure  of  a  filter  optimum  for  a  "packet"  (train)  of  n  periodic  pulses 
(Figure  2.9): 

1.  The  filter  will  comprise  two  units  —  Ki  (/<*>)  and  /f,  (/«)  . 

2.  Unit  (/#»)  is  a  filter  optimum  for  a  single  pulse.  Therefore,  this 
unit's  frequency  characteristic  |/f,  (/u)|  has  the  form  of  a  continuous  curve  (Figure 
2.14a)and  will  not  depend  on  the  number  of  pulses  n. 

3.  Unit  /C,  (/«)  structure  and  frequency  characteristic  will  depend  considerably 
on  the  number  of  pulses  n.  Where  n  1,  this  unit  has  a  comb-shaped  frequency 
characteristic  with  narrow  teeth  (Figure  2.15b)  and  therefore  is  referred  /47 

to  as  a  comb  filter.  The  greater  the  n,  the  narrower  the  comb  teeth. 

4.  A  comb  filter  may  be  realized  using  various  methods  (see  2.10 — 2.12). 


In  the  event  a  circuit  with  feedback  is  used  (Figure  2.12),  feecfcack  magnitude 
jO  Kq  must  be  selected  from  the  following  relationships: 

a)  where  n  a  oo  f  it  must  be J3kq  =  1  (see  formula  (2.51)).  However,  this 
case  is  unrealistic  since  n  =  Oo  cannot  occur  and  it  is  impossible  to  obtain  stable 
operation  of  the  Figure  2.12  circuit  where^  Kq  =  1; 

b)  Where  n  s  1,/?KQ  must  equal  0  (since  K '» (/■»)=  I  must  be  the  case  here); 

c)  The  greater  the  n,  the  narrower  the  teeth  must  be  of  curve  I  K»  (/<•')  I 
and,  accordingly,  the  closer  magnitude /^K  is  to  unity. 


However,  it  should  be  kept  in  mind  that,  where  n  i  Oo  ,  the  Figure  2.12  filter 
with  feedback  does  not  correspond  fully  to  an  optimum  comb  filter  (since  expression 
(2.50)  precisely  coincides  with  (2.49)  only  where  n — ^°°) . 


Example  5.  The  signal  will  comprise  a  rectangular  video  pulse  train  with 
spacing  Tn  (Figure  2.16).  In  this  event,  it  is  not  difficult  to  obtain  the  following 


Figure  2.16 


results,  by  means  of  both  direct  computation  from  formula  (2.30)  and  the  method 
of  slowly-changing  amplitudes. 

An  optimum  filter,  as  was  the  case  in  the  preceding  example,  will  comprise 
two  units  K ,(/*»)  and  Kt  (/“)  (Figure  2.10),  where  Kt  (/«>>)  —  filter  optimum 

for  a  single  video  pulse  (see  formula  (2.44)  and  Figure  2.5),  while  K,(/w)  -- 
comb  filter  described  by  formula  (2.49). 

Consequently,  unit  Kt  (/«»)  has  a  structure  identical  to  that  for  the  video 
pulse  train  examined  in  Example  4  and  may  be  realized,  in  principle,  by  the  identical 
means  (Figures  2-10 — 2.12).  However,  comb  filter  realization  in  the  case  of  radio 
pulses  is  significantly  more  complex  than  is  the  case  for  video  pulses  since, 
in  the  former  case,  much  higher  requirements  are  levied  for  precise  accomplishment 
and  time  delay  T„  stability. 

If  you  denote  tolerable  time  delay  instability  (or  imprecise  accomplishment) 
by  Ar„  ,  then  for  radio  pulses,  there  is  the  requirement  to  meet  the  condition 

AT g  'K.  T%  =»  ~j~  > 


then,  as  was  the  case  for  video  pulses,  it  is  sufficient  to  meet  the  condition 


A7*n  <  T». 


Actually,  it  follows  from  formula  (2.49)  that  an  optimum  filter  unit  must 


sum  the  pulses  arriving  at  its  input  with  time  delays  T„,  2 rn. ....  (n  —  t)  r„.  in 
order  that  the  peak  signal  voltage  obtained  during  summing  is  maximum,  for  radio 
pulses  it  is  necessary  that  the  time  delay  error  be  slight  compared  with  pulse 
high-frequency  occupation  period  Tq.  This  error  must  be  as  great  as  possible 
in  the  case  of  video  pulses. 

It  also  follows  from  formula  (2.49)  that,  in  essence,  pulse  accumulation 
be  the  basic  operation  during  optimum  periodic  pulse  train  filtration.  Such 
accumulation  may  be  accomplished  not  only  by  means  of  delay  lines  (or  line),  but 
also  by  other  known  methods,  such  as  with  the  aid  of  storage  tubes. 

2.4  Quasi-Optimum  Linear  Filters 

Those  linear  filters,  the  shape  of  whose  frequency  characteristic  is  preassigned 
and  maximum  signal-to-noise  ratio  is  provided  only  by  appropriate  frequency  char¬ 
acteristic  bandwidth  selection,  are  referred  to  as  quasi-optimum  filters.  In 
other  words,  a  quasi-optimum  filter  is  one  in  which  only  the  bandwidth,  rather 
than  the  frequency  characteristic  shape,  is  optimum. 

The  problem  of  finding  such  a  filter  was  solved  for  the  first  time  by  V.  I. 
Siforov  [35].  Siforov  examined  a  signal  in  the  form  of  a  single  radio  pulse  with 
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Figure  2.17 


Figure  2.18 


Figure  2.19 


a  rectangular  envelope  (Figure  2.7)  and  assumed  that  frequency  characteristic 
|  /((/on) |  has  a  rectangular  shape  with  bandwidth  A<>>  —  2n/7  (Figure  2.17).  He  demonstrated 


that  maximum  signal-to-noise  ratio  r  at  the  output  of  such  a  filter  (formula  2.28) 
is  obtained  where  the  filter  bandwidth  value  is 


(2.56) 


Here 


,* 

r  HIM 


0,82 


2Q 

N . 


(2.57) 


It  follows  from  comparison  of  relationships  (2.34)  and  (2.57)  that,  in  the 

case  of  a  quasi-optimum  filter,  signal  energy  Q  greater  by  a  factor  of  1.22  is 

required  to  obtain  such  a  value  rMaHe  •  Consequently,  in  this  case  a  quasi-optimum 

filter  provides  a  relatively  small  loss  in  required  signal  energy— less  than  1  dB. 

Also  important  is  the  fact  that  the  filter  bandwidth  P  value  turns  out  to  be 

relatively  slightly  critical — when  bandwidth  P  deviates  by  not  more  than  a  factor 

of  1.5  from  its  optimal  value  ^onr  (towards  an  increase  or  decrease),  magnitude 
2 

r  decreases  by  not  more  than  a  factor  of  1.25,  i.  e.,  the  loss  in  required  signal 
energy  does  not  exceed  1  dB. 


The  frequency  characteristic  of  actual  filters  has  a  smooth,  rather  than 
a  rectangular,  nature  (Figure  2.18)  and  often,  with  respect  to  shape,  approximates 
a  gaussian  curve  of  the  form 


l*<Z2Li  w (2.58) 

Here,  for  a  radio  pulse  with  a  rectangular  envelope,  a  quasi-optimum  filter 
provides  even  better  results  than  in  the  case  of  a  rectangular  frequency  charac¬ 
teristic. 

Actually,  in  the  case  of  a  radio  pulse  with  a  rectangular  envelope,  optimum 
frequency  characteristic  shape  takes  the  form  depicted  in  Figure  2.14a.  It  is 
evident  from  comparing  Figures  2.14a,  2.17,  and  2.18  that,  in  shape,  an  actual 


frequency  characteristic  is  closer  to  optimum  than  is  a  rectangular  frequency 
characteristic. 


An  actual  radio  pulse  has  a  smoother,  rather  than  a  strictly  rectangular, 
envelope,  which,  in  several  instances,  is  shaped  approximately  like  a  gaussian 
curve  of  the  type  (Figure  2.19) 


(2.59) 

This  circumstance  brings  a  quasi-optimum  filter  even  closer  to  an  optimum 
filter. 

Actually,  a  pulse  of  the  type  (2.59)  has  frequency  spectrum 


_  <**-«»»>* 

1 5  (/«)!*»  Be  ««  .  (2.60) 

Therefore,  frequency  characteristic  (2.58)  is  optimum  for  such  a  filter  (given 
appropriate  factor  a  selection,  i.  e.,  the  width  of  the  gaussian  curve).  Conse¬ 
quently,  if  the  shape  of  a  quasi-optimum  filter  frequency  characteristic  approximates 
a  gaussian  curve,  while  the  radio  pulse  envelope  also  approximates  a  gaussian 
curve,  then  a  quasi-optimum  filter  essentially  coincides  with  an  optimum  filter. 

It  follows  from  this  that,  for  actual  radio  pulses  and  actual  frequency 
characteristics,  a  quasi-optimum  filter  provides  even  less  loss  compared  to  an 
optimaum  filter  than  is  the  case  examined  above  for  the  rectangular  envelope  and 
rectangular  frequency  characteristic,  i.  e.,  the  resultant  loss  in  required  signal 
energy  is  significantly  less  than  1  dB.  In  addition,  the  deviation  of  bandwidth 
P  from  its  optimal  value  in  the  case  of  real  pulses  and  real  frequency  characteristics 
manifests  itself  considerably  less  than  in  the  case  examined  above. 

Thus,  in  the  case  of  a  single  radio  pulse,  a  quasi-optimum  filter  provides 
results  almost  identical  to  those  for  an  optimum  filter  and  there  is  no  requirement 
to  build  strictly  optimum  filters.  Selection  of  quasi-optimum  filter  bandwith 
is  not  very  critical  here.  Evidently,  these  conclusions  also  are  valid  for  a 
single  video  pulse. 


In  the  case  of  a  periodic  radio  pulse  (or  video  pulse)  train,  the  loss  /50 
when  a  quasi-optimum  rather  than  an  optimum  filter  is  used  is  great  and  the  magnitude 
of  this  loss  increases  with  an  increase  in  the  number  n  of  train  pulses. 

The  magnitude  of  the  loss  also  will  strive  towards  infinity  when  n— *>00. 


Actually,  when  n  — *"CD,  the  optimal  frequency  characteristic  will  comprise 
discrete  infinitelv-fine  lines  (Figure  2.14b)'  at  a  time  when  the  frequency  char¬ 
acteristic  of  a  quasi-optimum  filter  (Figure  2.18)  is  continuous.  Therefore, 
the  effective  area  of  the  frequency  characteristic  square  I  /C(/o>)  |  *  determining 
the  power  of  the  noise  at  output  is  an  infinite  number  of  times  greater  for  a 
quasi-optimal  frequency  characteristic  than  for  an  optimum  filter  frequency 
characteristic. 


The  smaller  the  number  n  of  train  pulses,  the  greater  the  area  of  the  optimal 
frequency  characteristic  teeth  (Figure  2.15)  and,  consequently,  the  less  the 
difference  between  optimal  and  quasi-optimal  frequency  characteristics. 

In  the  case  of  a  periodic  pulse  train,  an  optimum  filter  will,  in  essence, 
comprise  two  units — filter  Ki(/W).  optimum  for  a  single  pulse  and  comb  filter 

/(,(/«>)  (Figure  2.10).  It  follows  from  what  has  been  said  that  replacement  of 
unit  Kt(j<»)  by  a  quasi-optimum  filter  in  a  majority  of  actual  cases  is  permissible 
since  it  will  not  lead  to  significant  signal- to-noise  ratio  loss. 


Replacement  of  comb  filter  K% (/«)  by  a  quasi-optimum  filter  (with  a  continuous 
frequency  characteristic)  leads  to  significant  loss,  radically  rising  with  an 
increase  in  the  number  n  of  train  pulses. 


2.5  Comments  on  Communications  and  Differences  Between  Optimum  Filters  and 
Optimum  Receivers 


In  conclusion,  some  comments  are  required  concerning  communications  and 
differences  in  formulation  of  the  problem  of  the  optimum  filter  and  the  optimum 
receiver. 
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The  problem  posed  will  concern  best  reproduction  of  message  x  in  the  case 
of  the  optimum  receiver* 

As  a  modulated  signal  passes  through  a  receiver,  oscillation  u  (t)  in  the 

C 

ideal  case  and  in  the  absence  of  noise  also  must  be  converted  into  message  x, 
i.  e.,  materially  change  its  shape.  Consequently,  in  the  case  of  a  receiver, 
even  a  very  powerful  change  ("distort ion")  in  signal  ux(t)  shape  still  does  not 
give  witness  to  distortions  of  message  x  this  signal  carries.  Thus,  for  example, 
if  the  signal  comprises  a  train  of  short-duration  pulses  modulated  with  respect 
to  amplitude  by  low-frequency  message  x(t),  then  such  a  signal  has  a  very  broad 
spectrum  and,  during  passage  through  a  relatively-narrowband  resonant  amplifier  /51 
of  high  frequency,  signal  ux(t)  shape  will  be  distorted  very  strongly.  However, 
the  law  'of  pulse  amplitude  modulation  will  remain  essentially  unchanged  here  and, 
following  detection,  may  reproduce  message  x(t)  almost  without  distortions.  It 
then  follows  that,  in  the  case  of  modulated  signals  (i.  e.,  in  the  case  where 
there  is  a  requirement  for  best  possible  reproduction  of  only  one  or  several  signal 
parameters,  rather  than  the  entire  signal),  the  optimum  linear  filters  examined 
in  §2.2  of  this  chapter  may  not  beet  solve  the  problem,  and  sometimes  even  are 
completely  unacceptable. 

We  will  compare  the  fallowing  two  cases  as  an  illustration. 

1.  The  requirement  is  to  reproduce  signal  ux(t)  with  minimum  mean  square 
error. 

2.  The  requirement  is  to  reproduce  message  x  carried  by  signal  ux(t)  with 
minimum  mean  square  error. 

In  both  instances,  let  the  signal  comprise  a  train  of  amplitude-modulated 
short-duration  radio  pulses,  while  noise  is  additive  normal  white  noise. 

We  assume  that  noise  intensity  is  slight  compared  with  signal  intensity. 

Then,  in  the  first  case,  when  any  change  in  signal  ux(t)  shape  is  a  distortion, 
it  is  advisible  to  use  a  filter  with  a  very  broad  bandwidth. 

In  the  second  case,  as  will  be  shown  in  §  9.1  and  elsewhere,  the  optimum 


filter  often  boils  down  to  a  linear  filter  and  subsequent  detector.  Here,  the 
optimum  linear  filter  is  one  providing  maximum  signal-to-noise  ratio  at  detector 
input,  i.  e.,  a  filter  of  the  type  examined  in  §  2.3.  Such  a  filter  has  bandwidth 
n  ss  1/t0  ,  i.  e. ,  significantly  narrower  than  in  the  preceding  case,  and  it 

causes  very  strong  signal  ux(t)  shape  distortions  (for  example,  it  converts  radio 
pulses  with  a  rectangular  envelope  into  those  with  a  triangular  envelope,  as  shown 
in  §  2.3).  However,  in  spite  of  this,  mean  square  error  in  message  x(t)  reproduction 
may  be  very  slight. 

Consequently,  in  the  case  examined,  the  filter  providing  the  greater  mean 
square  error  in  signal  ux(t)  reproduction  makes  it  possible  to  obtain  a  lesser 
mean  square  error  in  reproduction  of  the  message  this  signal  carries.  For  this 
reason,  for  the  theory  of  optimal  methods  of  reception  of  modulated  signals,  linear 
filters  providing  maximum  signal-to-noise  ratio  at  their  output  have  a  significantly 
greeter  value  than  linear  filters  providing  minimum  mean  square  error  in  signal 
reproduction  at  output. 

Linear  filters  providing  minimum  mean  square  error  are  useful  when  the  require¬ 
ment  is  for  best  reproduction  on  a  noise  background  of  all  signal  u  (t)  voltage, 

C 

rather  than  one  or  several  signal  parameters.  Therefore,  they  mainly  may  be  used 
for  radio  receiving  devices  in  stages  connected  beyond  the  detector. 

Consequently,  in  radio  receiving  devices,  filters  providing  maximum  /52 
signs l-to-noise  ratio  mainly  are  used  in  the  high-frequency  amplifier  (i.  e. , 
ahead  of  the  demodulator),  while  filters  providing  minimum  mean  square  error  are 
used  in  the  low-frequency  amplfier  (i.  a.,  beyond  the  demodulator).  The  advisibility 
of  including  such  filters  in  the  optimum  receiver  is  substantiated  in  more  detail 
in  subsequent  chapters. 
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METHOD  OF  REDUCING  "NON-WHITE"  NOISE  TO  "WHITE"  NOISE* 

3.1  General  Relationships 

The  method  of  "white”  noise  reduction,  i.  e. ,  noise  with  random  power  spectrum 
Sm( ®)  ,  to  "non-white"  noise  turns  out  to  be  useful  in  the  theory  of  optimal 

reception  methods.  This  method  calls  for  signal- plus-noise  y(t)  preliminarily 
to  be  converted  so  that  non-white  noise  within  it  is  converted  to  white  noise. 

The  advisibility  of  doing  so  is  tied  in  with  the  fact  that  finding  the  optimal 
system  for  white  noise  usually  is  a  significantly  easier  problem.  In  addition, 
its  solution  (for  white  noise)  in  many  cases  already  is  known  and  the  new  problem 
thus  boils  down  to  one  already  solved. 

V.  A.  Kotel'nikov  proposed  and  used  a  method  for  reducing  "non-white"  noise 
to  "white"  noise  for  the  first  time  in  1946  to  solve  the  optimum  receiver  problem 
[1].  Subsequently,  it  also  was  used  in  several  other  works  [5,  8,  and  others]. 

We  will  now  examine  this  method. 


*"Non-vhite"  noise  often  is  referred  to  also  as  correlated  noise 


Figure  3.1 

Let  a  combination  of  3ignal  ux(t)  and  noise  arrive  at  receiver  P  input 

(Figure  3.1a): 


MO  +«.(/). 


(3.1) 


where  um( l)  —  non-white  noise  having  known  power  spectrum  5b(«i>)  .  The  requirement 
is  to  construct  receiver  P  in  such  a  way  that  signal  ux(t)  (or  message  x)  will 
be  extracted  in  the  best  (in  a  certain  sense)  manner. 

The  following  artificial  routine  may  be  used  instead  of  direct  solution  of 
the  problem.  We  will  assume  that  combination  y(t)  passes  across  a  linear  four- 
terminal  network  with  transfer  constant  Ki(/®)  (Figure  3.1b)  and  converts  combination 
y(t)  into  y'(t)  so  that  the  non-white  noise  is  converted  to  white  noise: 

if(/)-«',(/)+«'.(0.  (3.2) 

where  —  white  noiaef  i.  e.,  its  power  spectrum  has  the  form 

Sm'  («•)  **  const. 


For  this,  transfer  constant  must  meet  the  condition 


IK,  {/»)!• 


» 


(3.3) 
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where  b  —  random  constant.  Thus,  combination  y'(t)  of  converted  signal  ux(t) 
and  white  noise  reach  receiver  P'  input.  Receiver  P*  selected  is  optimum,  i.  e., 
is  such  that  it  will  extract  signal  ux(t)  (or  message  x)  in  the  best  way  in  the 
identical  sense  that  receiver  P  did  in  the  Figure  3.1a  circuit. 

We  will  demonstrate  that  the  Figure  3.1b  system,  comprising  linear  filter 
and  optimum  receiver  P',  also  insures  identical  quality  extraction  of 
signal  ux(t)  (or  message  x)  from  combination  y(t),  as  did  Figure  3.1a  optimum 
receiver  P. 

The  Figure  3.1b  circuit  differs  from  that  in  Figure  3.1a  by  presence  at  input 
of  a  linear  four-terminal  network  with  transfer  constant  #,(/«»)  .  It  seems 
at  first  glance  that  inclusion  of  this  network  equates  to  placing  a  significant 
constraint  on  the  optimum  system  found  and  must,  as  a  result,  lead  to  a  worse 
result.  However,  in  actuality,  no  such  constraint  will  exist  if  you  assume  that 
desired  optimum  receiver  P'  may  include  a  unit  with  transfer  constant  l/K, (/<»>) 

(Figure  3.1c). 

Actually,  in  this  event,  initial  oscillation  y(t)  is  reestablished  completely 
at  receiver  P"  input  and  receiver  P"  may  provide  results  identical  to  those  of 
receiver  P  in  the  initial  Figure  3.1a  circuit. 

Thus,  we  have  shown  that  the  Figure  3.1b  system  in  principle  may  provide 
results  identical  to  those  the  Figure  3.1a  circuit  provides. 

We  now  will  demonstrate  that  the  Figure  3.1b  system  in  principle  not  /54 
may,  but  actually  does,  provide  such  results.  While  finding  the  optimum  receiver 
P  structure  (Figure  3.1a),  we  start  from  the  optimization  predetermined  mathematically 
from  the  minimum  mean-square-error  criterion,  for  example: 

5*“lT(0  — *(01‘  “min, 

where  x(t)  —  reproduced  message  real  value,  while  T (t)  —  result  obtained  at 
receiver  output.  While  finding  optimum  receiver  P'  (Figure  3.1b),  we  will  start 
with  the  exact  same  optimization  (evidently,  inclusion  of  compensating  four- 
terminal  network  with  transfer  constant  Ki(/m)  does  not  affect  the  form  of  this 
criterion) . 


We  will  assume  that,  as  a  result  of  mathematical  analysis  of  the  structure 
of  optimum  receiver  P',  we  learned  that  the  Figure  3.1b  system  provides  poorer 
results  (greater  mean-square-error  value  71  ,  in  this  case)  than  did  the  Figure 

3.1a  system.  This  denotes  that  receiver  P*  provides  poorer  results  (greater 
&  magnitude  at  its  output)  than  it  may  provide  in  principle  (even  though  it 
was  shown  above  that  it  in  principle  may  provide  just  as  good  results  as  does 
receiver  P).  But,  this  contradicts  receiver  P'  determination  of  optimality,  in 
accordance  with  which  its  structure  must  be  such  that,  of  all  its  possible  structures, 
it  will  provide  the  best  possible  results  at  output,  i.  e.,  minimum  5*  magnitude. 

Therefore,  if  when  finding  the  optimum  receiver  P'  structure  we  start  with 
the  optimization  used  for  the  initial  system  (Figure  3.1a)  and  we  do  not  preclude 
this  possible  structure  from  containing  unit  l//C,(/a>)  ,  then  the  Figure  3.1b 

system  not  only  may,  but  also  must,  provide  the  same  result  as  does  the  Figure 
3.1a  system.* 

Thus,  we  have  demonstrated  that  the  Figure  3.1b  system  provides  the  same 
result  as  does  the  Figure  3.1a  system.  This  denotes  that  the  system  optimum  for 
non-white  noise  with  spectrum  Sa(m)  comprises  a  linear  four-terminal  network 
with  transfer  constant  Ki(/®)  satisfying  relationship  (3.3)  and  receiver  P’ 
optimum  for  combination  y'(t)  comprising  converted  signal  u^t)  and  white  noise. 
Therefore,  if  the  methodology  for  finding  the  receiver  optimum  for  signal-plus-noise 
is  known,  then  the  Figure  3.1b  circuit  makes  it  possible  to  find  the  receiver 
optimum  for  non-white  noise. 

Extraction  of  "corrective”  unit  K i(/®)  (Figure  3.1a,  b)  from  the  receiver 

P  complement  is  a  purely  intermediate  mathematical  operation  introduced  only  to 
simplify  mathematical  analysis.  Therefore,  there  is  no  requirement  for  the  /55 
physical  realization  of  unit  j  it  suffices  for  only  the  receiver  as  a 

whole  to  be  physically  realizable  (or  approximating  physical  realization). 

This  denotes  that,  if  receiver  P*  (Figure  3.1b)  comprises  a  linear  system 

*As  noted  in  [175,  page  212],  such  a  result,  strictly  speaking,  is  completely 
accurate  only  assuming  unrestricted  processing  time  since,  otherwise,  complete 
"whitening"  of  the  noise  is  impossible. 


with  transfer  constant  A  •((">>.  in  the  input  portion,  then  there  is  a  requirement 
for  physical  realization  only  of  function  .</••»»  ,  but  not  of  each  of  its 

factors  Ki(/w)  and  K *(/«»)  . 

3.2  Using  General  Relationships  to  Find  an  Optimum  Linear  Filter 

Let  combination  y(t)  comprise  signal  u  (t)  with  known  complex  spectrum 

U 

o(/«)  and  noise  «m(0  with  power  spectrum  £„,(<•») .  Then,  unit  (Figure 

3.1b)  has  a  frequency  characteristic  determined  by  formula  (3.3)  and  combination 
y'(t)  will  comprise  signal  uc(t)  and  white  noise  um'(()  . 

I 

Signal  u  (t)  complex  spectrum  S'f/m)  equals 

C 

S'  (/co)  =*  (/«)  S  (/<*),  (3.4) 

while  white  noise  um'(0  has  spectral  density  NqI  where  N #'  «=  (ujpl&f  • 

Let  the  requirement  be  to  find  a  linear  filter  providing  receipt  of  maximum 
ratio  '’m.no  of  signal  voltage  to  mean  square  noise  voltage.  This  signifies  that 
Figure  3.1b  receiver  P*  in  this  case  must  be  a  linear  filter  providing  maximum 
signal-to-noise  ratio  for  a  signal  with  frequency  spectrum  £'(/«>)  and  white 

noise. 

The  solution  to  this  problem  was  presented  in  §  2.3  and  boils  down  to  the 
fact  that  this  filter's  transfer  constant  Kt (/«)  must,  in  accordance  with  formula 
(2.30),  meet  the  condition 


=aS’ •  (/«») e-A*., 


(3.5) 


where  S'*(]w)  —  complex  conjugate  function  with  . 

It  follows  from  (3.4)  that 


S'*  (/<*)=» 


(3.6) 


K j(/w)  —  this  is  the  transfer  constant  of  the  Figure  3.1b  P'  unit. 


Therefore,  the  entire  optimum  system's  transfer  constant  /((/co)  equals 


(3.7) 


It  follows  from  (3.5) — (3-7)  that 


K  (/»)  -  a/C,  (/»)  /C,*  (/«)  S*  (/to)  . 


(/«)  /Ci*  0®)  -I  *,(/«)  |  •-  ; 

®at  W 


consequently, 


where  s^  —  some  constant  magnitude. 


(3.8) 


Formula  (3.8)  also  determines  complex  transfer  constant  /C(/<»)  of  a  linear 
filter  insuring  receipt  of  maximum  signal- to-noise  ratio  in  the  case  of  non-white 
noise  with  power  spectrum  Sm( o»). 

It  follows  from  comparison  of  formulas  (2.30)  and  (3.8)  that,  as  opposed 
to  the  white  noise  formula,  noise  power  spectrum  Sm(®)is  included  in  the  denominator 
of  the  non-white  noise  formula. 

It  follows  from  (3.8)  that  the  optimum  filter  frequency  characteristic  is 
determined  by  the  relationship 


(3.9) 


i.  e.,  this  filter's  frequency  characteristic  is  proportional  to  the  ratio  of 
the  signal  ampl~  •  spectrum  to  the  noise  power  spectrum. 


We  will  examine  as  our  example  a  case  where  signal  and  noise  spectra  take 
the  form  of  gaussian  curves; 


|  S  (/o>)  | 1  =  S,*  ; 

So,  (<i»)“ 


(3.10) 


It  is  known  that  such  spectra,  although  sufficiently  close  to  physical  realization, 
are  not  physically  realizable. 

It  follows  from  (3.9)  and  (3.10)  that  the  optimum  filter's  frequency  charac¬ 
teristic  takes  the  form 


(3.11) 


where  Ao»c/Aw_  —  ratio  of  the  width  of  the  signal  and  noise  spectra. 


It  follows  from  this  formula  that,  given 


A»e  „ _ L_ 

Z5n<  YZ 


(3.12) 


the  optimum  filter's  frequency  characteristic  takes  the  form  of  a  gaussian  curve 
and,  consequently,  is  close  to  being  physically  realizable.  If  condition  (3.12) 
is  not  met,  frequency  characteristic  |K(/®)I  is  very  remote  from  physical  reali¬ 


zation. 


Now,  we  will  explain  what  the  energy  Q'  of  the  converted  signal  is  in  this 


case. 


=  $  |S'(/»)|'d®. 
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Considering  (3.3)  and  (3.4),  we  have 


/57 


Wt&L'-d*. 

Sm(w) 


From  relationships  (3.10)  and  (3.13),  we  obtain 


(3.13) 


(3.14) 


Where  A<oe/h<an  >  lf  energy  Q'  will  become  infinitely-great  or  imaginary  and  the 
solution  found  becomes  senseless;  therefore,  it  must  be 


'•  (3.15) 

However,  it  follows  from  comparison  of  relationships  (3.12)  and  (3.15)  that 
condition  (3.12)  is  more  rigid  than  (3.15).  This  signifies  that  the  validity 
of  the  solution  found  is  constrained  by  the  fact  that  the  solution  found  is  very 
far  from  physical  realization,  rather  than  by  the  fact  that  energy  Q'  will  become 
infinitely-great  or  imaginary. 

Thus,  in  the  case  of  spectra  of  the  (3.10)  type,  optimum  solution  (3.11) 
found  makes  sense  only  when  condition  (3.12)  is  met. 

If  the  noise  spectrum  is  narrow  in  comparison  with  the  signal,  the  solution 
found  turns  out  to  be  inapplicable.  However,  in  this  case,  the  optimum  solution 
is  evident — it  suffices  to  connect  a  linear  trap  with  trap  band  equalling  the 
noise  spectrum  width  (the  latter  here  is  understood  to  mean  the  frequency  region 
outside  of  which  noise  spectrum  energy  is  infinitesimally-small)  in  order  to  obtain 
maximum  signal-to-noise  ratio  at  receiver  output.  Then,  noise  voltage  essentially 
will  be  absent  at  filter  output  and  signal-to-noise  ratio  will  be  very  great. 


3.3  Conclusions 


The  problem  of  optimum  signal  u  (t)  (or  message  x  this  signal  carries)  reception 
an  a  background  of  additive  non-white  noise  with  power  spectrum  Sm(to)  boils  down 

f 

to  finding  receiver  P'  optimum  (in  the  same  sense)  for  converted  signal  ux(t) 
and  white  noise. 

The  converted  signal  is  obtained  from  the  initial  signal  with  passage  through 
a  linear  filter  with  transfer  constant  *i(/w)  ,  whose  modulus  is  determined  by 
relationship 


l*i  (Ml* 


_  b 
Sm  (o>) 


Desired  optimum  receiver  P  for  signal  ux(t)  will  comprise  receiver  P'  and 
*  linear  filter  with  transfer  constant  *,(/»»)  preceding  it. 

Methods  obtained  for  white  noise  using  the  aforementioned  approach  in  many 
cases  may  be  generalized  relatively  simply  in  the  case  of  noise  with  an  essertially- 
random  power  spectrum.  Therefore,  subsequent  material  is  presented  mainly  as 
it  relates  to  white  noise. 
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PART  TWO 
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OPTIMAL  RECEPTION  OF  PRECISELY-KNOWN  SIGNALS  (KOTEL'NIKOV  THEORY  OF  POTENTIAL 
NOISE  IMMUNITY) 

CHAPTER  FOUR 

GENERAL  RELATIONSHIPS 

4.1  Problem  Formulation 

Let  the  signal-noise  sum  reach  receiver  input  (Figure  1.1) 

-MO  +«»(*).  (4a) 

where  ux(t)  —  precisely-known  signal,  i.  e.f  a  signal  whose  only  unknown  parameter 
is  desired  message  x,  while  «m(Q  —  noise. 

Message  x  at  the  point  of  reception  is  considered  a  random  variable  or  random 
time  function  with  known  a  priori  distribution  P(x).  Noise  distribution  law 
also  is  assumed  to  be  known.  The  receiver  analyzes  oscillation  y(t)  during  pre¬ 
determined  (finite)  time  interval  T  (Figure  4.1)  and,  based  on  this  analysis, 
must  reproduce  message  x  in  the  best  possible  way  (in  the  sense  indicated  below). 
Since  the  signal  is  distorted  by  random  noise  «to(/)  ,  while  analysis  time  T  is 


finite,  then  there  is  no  receiver  in  principle  that  may  reproduce  message  x  completely 
accurately,  with  complete  reliability.  There  always  will  be  a  certain  probable 
error. 

Consequently,  the  most  one  can  expect  of  a  receiver  under  such  conditions 
is  to  determine  the  probability  of  a  particular  value  (or  of  a  particular  realization) 
of  message  x  for  a  given  realization  of  total  oscillation  y(t)  arriving  at  receiver 
input. 


Mathematically,  this  denotes  that,  based  on  analysis  of  total  oscillation 
y(t),  the  ideal  receiver  must  compute  distribution  Py(x)  for  all  possible  values 
(or  realizations)  of  message  x  for  given  realization  y(t)  (given  realization  y(t) 
always  is  understood  to  mean  that  form  of  oscillation  y(t)  which  it  has  in  given 
observation  cycle  T) . 

Py(x)  is  a  distribution  of  a  random  variable  if  message  x  is  a  discrete  /60 
random  magnitude;  if  x  is  an  analog  random  magnitude  (or  a  time  function),  then 
Py(x)  is  a  probability  density  (see  Figure  4.2,  for  example). 

In  future,  for  brevity  we  will  call  Py(x)  a  distribution  of  a  random  variable 
in  all  cases,  understanding  however  that,  in  all  cases  where  x  is  an  analog  random 
magnitude  (or  a  time  function),  Py(x)  is  a  probability  density. 

Distribution  Py(x)  is  referred  to  as  a  posteriori  (empirical)  since  it  may 
be  found  only  as  a  result  of  analysis  of  oscillation  y(t)  realization.  Distribution 
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py(x)  differs  in  principle  thereby  from  a  priori  distribution  P(x),  which  is  assumed 
to  be  known  beforehand,  i.  e.,  prior  to  analysis  of  oscillation  y(t). 

Distribution  Py(x)  often  is  referred  to  also  as  an  inverse  probability 
distribution  since  it  shows  what  the  probabilities  are  of  certain  values  of  cause 
x  if  effect  y  brought  about  by  this  cause  is  known. 

In  future,  for  brevity  we  will  call  Py(x)  an  inverse  probability  distribution. 

Thus,  the  best  that  a  receiver  in  principle  can  offer  on  the  basis  of  analysis 
of  realization  y(t)  is  to  compute  distribution  Py(x)  of  message  x  inverse  proba¬ 
bilities  for  all  possible  values  of  these  messages  (see  Figure  4.2,  for  example). 


A  decision  then  must  be  made  from  analysis  of  the  distribution  type  as  to 
what  will  be  the  value  x  of  a  transmitted  message.  This  decision  may  be  made 
by  an  operator  (observing  the  Figure  4.2  Py(x)  distribution  on  an  oscillograph 
screen,  for  example)  or  can  be  made  by  the  receiver  itself.  In  the  latter  case, 
a  rule  must  be  made  for  the  receiver  so  that  it  will  make  its  decision  based 
upon  analysis  of  distribution  Py(x).  One  such  rule  may  be  the  so-called  maximum 
inverse  probability  principle. 


The  assumption  in  this  event  is  that 


y  = 


(4.2) 


where  xun  —  most  probable  message  value,  i.  e.,  that 
probability  Py(x)  has  the  greatest  value  (Figure  4.2). 
case,  the  shape  of  curve  Py(x)  for  various  realizations 
the  most  probable  value,  may  differ  (Figure  4.2).  This 
by  introduction  of  index  y  into  designation  xm  . 


value  at  which  inverse 
In  the  general  /61 
of  y(t)  and,  consequently, 
circumstance  is  underscored 


Consequently,  when  the  maximum  inverse  probability  principle  is  used,  the 
receiver  issues  that  message  value  ^  most  probable  for  a  given  y(t)  realization. 


Another  decision  rule  also  may  be  established  for  the  receiver.  For  example, 


it  is  possible  to  require  the  receiver  to  compute  abcissa  x^  of  the  center  of 
"gravity”  of  the  area  captured  under  curve  Py (x)  (Figure  4.2)  and  supply 


Y  =  *va. 


(4.3) 


Kotel'nikov  demonstrated  that,  given  known  conditions,  such  a  rule  insures 
minimum  message  reproduction  mean  square  error. 

It  is  possible,  in  the  general  case,  to  propose  an  even  infinite  number  of 
rules  acfc/isible  for  this  or  that  reason  by  which  the  receiver  must  make  a  decision. 
Each  such  rule  may  be  considered  an  appropriate  receiver  optimization. 


It  will  be  shown  in  Chapter  19  that,  in  several  instances,  different  rules 
will  lead  to  identical  results.  Thus,  for  example,  in  certain  instances,  the 
maximum  inverse  probability  principle  also  results  in  minimum  mean  square  error, 
i.  e. ,  rules  (4.2)  and  (4.3)  may  provide  identical  results. 

However,  in  the  general  case,  different  optimum  receiver  structures  and 
properties  correspond  to  different  optimizations. 


Kotel'nikov  considers  optimum  that  receiver  operating  on  the  maximum  inverse 
probability  principle,  i.  e.,  in  accordance  with  rule  (4.2).  This  principle  is 
one  of  the  simplest  and  most  natural  and,  in  many  cases,  turns  out  to  be  most 
advisible. 

4.2  Computation  of  Message  Inverse  Probabilities 

Inverse  probability  Py(x)  may  be  found  from  the  following  general  relationships 
of  the  theory  of  probabilities: 


p  (x.  y)-P  {x)  Px  («/)  =  P  (y)  P,  (*). 


(4.4) 


where  P(x,  y)  —  mutual  probability  of  two  random  magnitudes  (or  functions)  x 
and  yj  Px(y)  —  conditional  probability  of  y  for  a  given  xj  P(y)  —  unconditional 
probability  y. 
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It  follows  from  (4.4)  that 


W^PWP.W-  (4.5) 

Since  when  computing  f met ion  P^(x)  we  are  interested  in  the  relationship 
of  this  function  to  x  when  y  is  unchanged,  factor  1/P(y)  in  expression  (4.5)  may 
be  replaced  by  some  constant  k: 

P,(x)-kP(x)Px(y).  (4*6) 

Coefficient  k  may  be  determined  from  normality  condition: 

$*»(*)<**-!,  (4.7) 

AX 

where  —  region  of  all  possible  values  of  x. 

Therefore* 


f  P  (*)  p*  (v)  4*  (4.8) 

Since  a  priori  distribution  P(x)  is  known,  then  only  distribution  Px(y )  remains 
to  be  found. 

It  follows  from  formula  (4.6)  that  one  must  know  the  relationship  of  Px(y) 
to  x  for  a  given  y  in  order  to  determine  function  P  (x).  Distribution  P  (y), 

7  * 

considered  a  function  of  x  for  a  given  y,  is  referred  to  as  a  likelihood  function. 
Px(y)  is  the  probability  (probability  density)  of  y  for  a  given  x.  Since 


*If  x  is  a  random  time  function,  then  expressions  J  and  ^ p p* ** 

are  an  abbreviated  notation  for  integrals  J— Jp»(*i . xn)dxl...dxn  and 

AX 

/  •••  J P(*, *<t)  PM . „„(*)  **i  respectively . 
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function  u  (t)  is  known  precisely  for  a  given  x,  then  it  follows  from  (4.1)  that, 
for  a  given  x,  the  probability  of  realization  y(t)  coincides  with  the  probability 
of  that  realization  of  noise  umlt)  equalling  difference  Ey(t) — ux ( t ) ] .  But,  the 
probability  (probability  density)  of  realization  utn(0  for  given  x  where  x  and  Uu 
are  independent  is  characterized  by  noise  ^m(«ui)  probability  density. 

Therefore,  if  message  x  and  noise  um(l)  statistically  are  independent  (which 
usually  is  the  case),  then, 

P,(y)  =  Wm(y-"*)-  (4.9) 

Kotel'nikov  demonstrated  that  noise  has  the  form  of  normal  white  noise. 

The  distribution  of  such  noise,  in  accordance  with  (1.25),  has  the  following  form: 

, .  -xU10" 

”(TSf?c 

Therefore  /63 


PJy) 


”777  §  ^ 


urn)- 

Substituting  this  expression  into  (4.6),  we  obtain 


(4.10) 


Pw(x)-ktP(x)t 


(4.11) 


where  k^  —  constant  coefficient,  which  will  not  depend  on  x  and  which  may  be 
found  from  normality  condition  (4.7). 


A  receiver  optimum  in  this  sense,  as  Kotel'nikov  assumed,  uses  formula  (4.11) 
to  compute  inverse  probability  P^(x)  for  all  possible  message  x  values  and  supplies 
in  the  form  of  decision  that  value  of  x  at  which  function  P^(x)  has  the  greatest 


value.  It  often  is  convenient  to  represent  expression  (4.11)  in  a  slightly-different 
form  for  practical  realization  of  such  a  computer,  formula  (4.11)  ia  squared 
in  order  to  do  this.  Then  we  will  obtain 

r 

-7b  §  ** <0  4> 

P9(x)  =4.P(jr)e  *°  z~Q*,N‘  e1*.  (4.12) 

where 


Q«=$  “«•<')  rf'  (4.13) 

is  the  energy  of  the  signal  carrying  message  x,  which  in  the  general  case  may 
depend  on  x  (for  instance,  for  amplitude  modulation);  magnitude  £  is  determined 
from  relationship 


(4.14) 

'  0 

r 

-Thl',in41 

Factor  e  0  in  expression  (4.12)  for  a  given  y  will  not  depend  on  x  and 
therefore  may  be  considered  as  constant  magnitudes  as  probability  P^(x)  is  computed. 
Consequently,  we  may  write  expression  (4.12)  in  the  form 

Pt{x)—ktP[x)e~Q*,Nt  c*»,  (4.15) 

where  kj  —  constant  factor,  which  may  be  computed  from  normality  condition  (4.7). 

It  follows  from  expression  (4.15)  that  computation  of  £x  from  formula  /64 
(4.14),  i.  e.,  finding  the  common  correlation  between  received  oscillation  y(t) 
and  anticipated  signal  ux(t),  is  the  main  operation  when  determining  inverse 
probability  Py(x). 

4.3  Optimum  Receiver  Structure 


The  device  computing  the  integral  in  expression  (4.14)  is  called  a  correlator 


Consequently,  the  correlator  is  the  main  optimum  receiver  element.  It  may  be 
realized,  for  example,  using  the  Figure  4.3  schematic  and  comprises  a  multiplier 


Figure  4.3 

and  an  integrator.  Here,  output  voltage  equals 

i 

“MuCO—cjj  y (/)«,(/) dt,  (4.16) 

where  c —  proportionality  factor,  whose  magnitude  is  determined  by  the  properties 
of  the  actual  physical  devices  used  to  carry  out  the  multiplication  and  integration 
operations.  It  follows  from  (4.16)  that  this  coefficient  has  dimension 
[1/(B  times  c)].  Expression  (4.16)  coincides  with  (4.14)  precise  to  a  constant 
coefficient  and  integration  limits. 

It  evidently  is  not  difficult  to  eliminate  the  difference  in  the  constant 
coefficient  by  appropriate  selection  of  the  multiplier  or  integrator  transfer 
constant . 

There  are  several  ways  to  obtain  requisite  integration  limits,  including 
connection  of  a  model  (copy)  of  anticipated  signal  ux(t)  only  for  time  interval 
0  —  T.  Here,  by  moment  t  a  T,  output  voltage  will  attain  requisite  value 

T 

®w«(^)  (4.17) 

0 

In  future,  this  voltage  will  be  retained  unchanged  (for  the  ideal  integrator). 
Therefore,  there  is  a  requirement  to  bring  the  integrator  output  to  zero  prior 
to  conducting  a  new  test  for  message  x  extraction. 


The  linear  filter  described  in  §  2.3,  Batched  with  the  anticipated  signal, 
aay  be  used  instead  of  a  correlator  if  8ignal  ux(t)  lasts  a  finite  time  /65 
interval  (0  —  T)  (which  usually  is  the  case) . 

Actually,  if  oscillation  y(t)  arrives  at  linear  filter  input,  then  the  voltage 
at  its  output  at  soaent  t  ia 


i 

«•*.*  (0  -  J  9  (r)  «!(/—»)*. 


where  n(0  —  filter  impulse  transient  characteristic. 

If  the  filter  is  optimum  for  signsl  ux(t)  (in  the  signal-to-noise  sense), 
then  its  characteristic  satisfies  relationship  (2.37): 

Since,  according  to  the  assumption,  signal  ux(t)  disappears  at  moment  t  s  T, 
then  one  may  assume  tQ  =  T,  i.  e. , 

r\(t)-aux(T—f). 

Therefore,  the  oscillation  at  the  output  of  such  a  filter  equals; 

aiw  (/)-«  $  y{z)ut {T—t+z) dt. 

—  as 

Since,  according  to  the  assumption,  there  is  no  signal  also  whan  t  0, 
than  ux(T— t  +  z)  s  0  where  T— t  +  z  0,  i.  e.,  where  z  <.  t—T ;  therefore 

i 

U»M%  (*)-«  \  y(?)  0  dz. 

lit 


At  moment  t  *  T,  we  obtain 


A  comparison  of  expressions  (4.17)  and  (4.18)  shows  that  they  coincide  with 
precision  to  a  constant  factor.  Consequently,  an  optimum  linear  filter  matched 
with  anticipated  signal  u^(t)  may  be  used  instead  of  a  correlator  (one  should 
assume  that  tQ  =  T  during  synthesis  of  such  a  filter).  Oscillation  y(t)  is  supplied 
to  filter  input  and  the  magnitude  of  oscillation  «aUs(0  at  filter  output  at  moment 
t  =  T  is  measured  (clipped). 

In  spite  of  the  fact  that,  given  the  conditions  enumerated  above,  a  matched 
filter  and  correlator  at  moment  t  =  T  supply  identical  results,  they  differ  greatly 
where  their  remaining  properties  are  concerned.  In  particular,  when  a  correlator 
is  used  (Figure  4.3),  there  is  the  requirement  at  the  point  of  reception  to  generate 
voltage  u^(t)  (signal  copy)  corresponding  to  the  anticipated  signal.  As  opposed 
to  this,  a  matched  filter  is  a  passive  system  not  requiring  generation  of  /66 
oscillation  u^(t).  It  is  simpler  to  realize  a  correlator  for  certain  signal  types 
(phase-shift,  for  instance),  while  a  matched  filter  is  simpler  for  others  (phase- 
modulated,  for  example). 

In  several  cases,  it  is  advisible  to  use  a  combination  of  correlator  and 
matched  filter,  i.  e.,  to  build  a  so-called  correlation-filtration  device.  The 
possibility  of  such  a  device  is  based  on  the  following  [129], 

It  often  is  possible  to  represent  signal  ux(t)  in  the  form  of  the  product 
of  two  known  time  functions,  i.  e. ,  to  assume 

".(0=/,  (/>/,(/)-  (4.19) 

For  example,  if  the  signal  is  a  segment  of  a  sinusoid  with  duration  T,  one  may 
assume 

/,(/)  =  £/  sin  ((M  + $), 

.  |l,  where  /=0-7\  (4.20) 

It  (0  = 

outside  these  limits 

i.  e.,  to  consider  that  f^(t)  —  sinusoid  (of  infinite  duration),  while  f2(t) 

—  rectangular  video  pulse. 


Substituting  (4.19)  into  (4.14),  we  obtain 

(OltU)dt. 

N,  J 


(4.21) 


This  expression  may  be  represented  in  the  form 

y‘  (0  U  0)  dt. 


(4.22) 


where 


y’(0~y  (0M0-  (4*23) 

But,  based  on  the  aforementioned,  a  transform  of  the  (4.22)  type  may  be  accomplished 
(precise  to  a  constant  factor)  by  a  linear  filter  matched  with  the  "signal" 

(0  *  </).  (4*24) 

while  transform  (4.23)  may  be  accomplished  by  a  multiplier.  Therefore,  operation 
(4.14)  may  be  accomplished  by  the  device  depicted  in  Figure  4.4,  where  04>’  — 


Figure  4.4 

filter  matched  with  signal  ux'(t).  At  moment  t  =  T,  voltage  at  this  device's 
output  will  be  identical  to  that  at  the  Figure  4.3  output  and  equalling  (precise 
to  a  constant  factor)  the  requisite  £  magnitude. 

The  Figure  4.4  device  is  referred  to  as  a  correlation- filtration  device  since 
it  will  comprise,  along  with  a  matched  filter,  a  multiplier  as  a  component  part 
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of  the  correlator.  Just  like  a  correlator,  the  dev/ice  requires  generation  at 
the  point  of  reception  of  a  "signal  copy" — function  f^(t).  Since  multipliers 
f^(t)  and  f£(t)  usually  are  simpler  functions  than  complete  oscillation  of  /67 
signal  ux(t),  use  of  a  correlation-filtration  device  often  makes  it  possible  to 
simplify  considerably  the  "signal  copy"  generator  (compared  to  a  purely  correlation 
device)  and  matched  filter  (compared  to  a  purely  filtration  device). 

For  instance,  if  signal  u  (t)  with  duration  T  comprises  a  periodic  train 
of  unmodulated  radio  pulses  and  the  selected  envelope  of  this  train  is  f^(t), 
then  the  Figure  4.4  schematic  will  have  the  following  structure:  "signal  copy" 
f^(t)  —  periodic  rectangular  video  pulse  train,  while  00'  —  linear  filter  matched 
with  a  sinusoidal  signal  with  duration  T. 

The  "signal  copy"  in  the  Figure  4.4  schematic  will  be  a  sinusoid,  while  filter 
00' must  be  matched  with  the  video  pulse  train  if  rf  (sinusoidal)  occupation, 
rather  than  radio  pulse  train  envelope,  is  selected  as  function  f^(t). 

It  follows  from  what  has  been  said  that  computation  of  magnitude  £  v.  for 
any  one  specific  type  of  signal  ux(t),  i*  8.,  for  one  message  x  value,  does  not 
present  significant  difficulties.  However,  to  find  distribution  Py(x),  magnitude 
£  v  must  be  determined  for  all,  rather  than  for  one,  possible  message  x  values 
(or  all  possible  realizations)  which,  in  the  general  case,  presents  major  diffi¬ 
culties.  These  difficulties  are  decreased  considerably  in  the  case  of  discrete 
messages,  when  x  may  have  only  a  finite  number  m  of  discrete  values.  Therefore, 
in  the  case  of  analog  messages,  message  digitization  (quantization)  often  is  used 
to  simplify  practical  computational  device  realization. 

It  often  is  advisible  to  determine  not  function  P^(x)  itself,  but  its  logarithm 
In  Py(x)  to  simplify  practical  realization  and  theoretical  computations. 

It  follows  from  (4.15)  that 


In  P,  (z)  -  In  *,  +  In  P  (*)--£-  + 


(4.25) 


Since  the  greatest  In  P  (x)  and  P  (x)  values  coincide  when  x  changes,  the 


optimum  receiver  operating  on  the  maximum  inverse  probability  principle  must  select 
that  value  of  x  for  which  In  Py(x)  has  the  greatest  (in  the  algebraic  sense)  value. 

Coefficient  l<£  does  not  impact  upon  the  form  of  curves  (4.15)  and  (4.25);  /68 
therefore,  for  simplicity  In  may  be  discarded. 

Then,  instead  of  (4.25),  we  obtain 


In />„(*)  =  In />(*)_?«  (4.25a) 

Nt 

Expression  (4.25a)  is  significantly  simpler  than  (4.15)  for  computations. 

The  resultant  relationships  are  valid,  both  for  discrete  messages  and  for 
individual  analog  message  values  and  for  oscillations  x(t).  A  more  detailed 
examination  of  these  three  cases,  in  the  order  that  Kotel'nikov  approached  them 
[1],  will  be  provided  in  the  following  section. 


94 


AD-8128  899  THEORY  OF  OPTIMUM  RADIO  RECEPTION  METHODS  IN  RANDOM  2/7 

NOISE(U)  FOREIGN  TECHNOLOGY  DIV  WRIGHT-PATTERSON  AFB  OH 
L  S  GUTKIN  24  SEP  82  FTD-ID<RS)T-0784-82 


UNCLASSIFIED  F/G  9/4  NL 


s*tKL3Ki*Efli 


CHAPTER  FIVE 

RECEPTION  OF  DISCRETE  MESSAGES 

5.1  General  Case 

Let  message  x  have  one  and  only  one  of  possible  values 

Jf§»  JTjv  Xfi  ...»  i 

where  xQ  —  zero  value  corresponding  to  the  absence  of  any  message  (spacing). 

A  priori  probabilities  of  the  presence  (sample)  of  these  messages  equals, 
respectively 

P  (■*•)»  P  ix P(xk)>  ••• »  P(xm)> 


while 


S  P(x0-1. 

kmO 


(5.1) 


Each  value  x.  of  the  message  has  its  corresponding  value  u  (t)  of  the  signal. 
k  xfc 


sannraraEBSii 


LV.W.V. 


Therefore,  one  and  only  one  of  the  following  possible  signals  must  be  present 
at  receiver  input  during  interval  (0,  T): 

«*,(0 . uXh(i), ....  H,m (/).  (5.2) 

In  future,  for  simplicity  magnitude  x  is  dropped  from  the  indices,  i.  e., 
instead  of  (5.2),  we  write 

Mi(0«  ....  uk  (/),  ...  um(t).  (5.2a) 

Zero  message  (xQ  =  0)  may  be  transmitted,  either  by  a  zero  signal  (in  this 
case,  uQ(t)  f  0  —  passive  spacing),  or  by  any  predetermined  known  signal  /69 
(then  u  £  0  —  active  spacing);  meanwhile  it  is  known  beforehand  which  of  these 
two  cases  must  occur. 

Signals  u^t)  are  assumed  to  be  precisely  known,  i.  e.,  the  shape  of  all 
signals  of  the  (5.2a)  type  is  known  precisely.  The  only  unknown  subject  to  deter¬ 
mination  is  exactly  which  of  these  signals  was  present  at  receiver  input  during 
interval  (0,  T). 

Consequently,  completely-determined  signal  uk(t)  corresponds  to  each  message 
xk  and,  on  the  other  hand,  if  signal  uk(t)  is  known,  then  message  xk  corresponding 
to  it  is  determined  unambiguously  in  the  same  way.  Therefore,  a  priori  probabilities 
P(uk)  of  the  presence  of  signals  u^t)  equal  the  a  priori  probabilities  P(xk) 
of  the  corresponding  messages: 

P  («»)  ~  P  (**)•  (5.3) 


fUt)  fut)  fu0)  (utt)  (um) 


Figure  5.1 


In  this  event,  a  priori  distribution  P(x)  has  the  form  depicted  in  Figure 

5.1. 

Using  analysis  of  realization  y(t)  as  its  basis,  the  receiver  must  determine 
exactly  which  of  the  m  +  1  possible  values  message  x  had  in  the  interval  (0,  T) 
examined,  i.  e.,  which  exactly  of  the  m  +  1  possible  signals  existed  at  that  time 
at  receiver  input. 

Since  signal  determination  thereby  denotes  determination  also  of  message 
xk  corresponding  to  it,  then  for  brevity  we  will  discuss  only  signals  in  the  future. 

Here,  general  relationships  (4.12)— -(4.14),  and  (4.25a)  may  be  written  in 
the  form 


P»  ("*)  =  4,  P  (uh)  e1*; 

(5.4) 

In  P9  («*)•■  In  P  («»)——-+ !»; 

(5.4a) 

r 

(5.5) 

0 

l*-~ly(0«Kit)dt. 

(5.6) 

Since  the  receiver  operates  on  the  maximum  inverse  probability  principle, 
then  it  reproduces  each  time  that  signal  uk  for  which  inverse  probability  Py(uk) 
turns  out  to  be  the  greatest.  Here,  various  kinds  of  error*  ere  possible  and 
composite  error  probability  P«*  equals 

P^-P  («,)P».  +  P  [Hi)  P„,  (+  Hi)  + ...  + 

+  P  («*)  P.*  (+  «*)  +  ...  +P  (Mm)  P*m  H*  »m) . 

(5.7) 

where  —  conditional  probability  that,  given  signal  uk  at  input,  / 70 

the  receiver  will  reproduce  some  other  signal  (it  is  m import ant  which  one), 
i.  e.,  an  error  occurs. 
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It  often  happens  to  be  simpler  to  compute  composite  correct  reproduction 
probability  ^apaa  ,  rather  than  composite  error  probability  Pom  s 

P npa*  =  P  (««)  Pu,  («.)  +  P  («i)  Put  («j)  +  +  P  («m)  («m).  (5.8) 

where  —  probability  (conditional)  that  the  receiver  will  reproduce  signal 

u^  when  signal  u^  actually  is  at  input. 

It  is  evident  that 

^iipan  “  ^  Ponr  (5.9) 

It  is  easy  to  become  convinced  that  a  receiver  operating  on  the  maximum  inverse 
probability  principle  provides  a  minimum  composite  error  probability  value. 

Actually,  Py(uk)  is  the  probability  that  signal  uk  will  occur  for  a  given  y. 

Each  time,  the  optimum  receiver  at  output  supplies  that  signal  u^,  for  which 
the  probability  is  the  greatest  for  a  given  y.  Consequently,  signal  selection 
error  probability  for  a  given  y  is  the  least.  This  occurs  for  any  y(t)  realization. 

Consequently,  for  every  y(t)  realization,  the  receiver  supplies  the  response 
with  the  minimum  error  probability.  Therefore,  resultant  composite  error  probability, 
considering  all  possible  oscillation  y(t)  realizations,  must  also  be  the  minimum 
possible  here.  Consequently,  from  the  point  of  view  of  minimum  composite  error 
probability,  the  maximum  inverse  probability  principle  provides  the  best  possible 
results.  However,  the  minimum  composite  error  probability  criterion  for  discrete 
messages  is  not  the  only  possible  and  advisible  one. 

Actually,  given  minimization  of  probability  Pom  (5.7),  errors  of  all  types 
uniformly  are  considered  dangerous.  For  example,  if  signal  u^  was  active  at  input, 
then  reproduction  of  signal  u^  instead  of  ^  is  considered  just  as  undesirable 
("dangerous")  as  reproduction  of  signal  u^  or  u^  and  so  on.  However,  in  several 
cases  (in  radar,  for  instance),  different  types  of  errors  have  a  different  "danger” 
and  there  is  a  need  to  insure  the  minimum,  not  of  expression  (5.7),  but  of  a 
more-complex  expression,  which  considers  the  "weicjht"  ("danger")  of  certain  errors. 


In  this  case,  appropriate  corrections  (examined  in  Part  Four)  must  be  introduced 
into  the  maximum  inverse  probability  principle. 

This  chapter  examines  only  the  simplest  case  where  a  receiver  operates  on 
the  maximum  inverse  probability  principle  in  accordance  with  the  work  of  Kotel'nikov. 


Here,  the  structure  of  the  optimum  receiver  must,  in  accordance  with  formulas 
(5.4)— (5.6),  take  the  form  depicted  in  Figure  5.2. 


The  receiver  will  comprise  computers  Bn,  Bt . Bm  ,  to  the  inputs  of  /71 

which  oscillation  y(t)  and  anticipated  signals  uQ,  u^,  .  .  . ,  uffl  are  supplied. 
Computed  inverse  probabilities  Py(uQ),  Py(u^),  •  •  •»  Py(um)  are  in 

comparison  circuit  CC  and  a  determination  is  made  as  to  which  has  the  highest 
value;  the  signal  (or  message)  corresponding  to  it  also  is  supplied  in  the  form 
of  output  magnitude  u*Mt  • 


Since  inverse  probabilities  Py(uk)  always  are  positive,  then  they  may  be 
reflected  by  the  corresponding  values  of  positive  constant  voltage.  Therefore, 
circuit  CC  may  be  realized,  for  instance,  with  the  aid  of  a  cathode-ray  tube  [CRT] 
so  that  a  specific  area  on  the  CRT  screen  will  correspond  to  each  channel  (each 
computer  £*  ) .  Here,  the  brightest  area  will  be  the  one  corresponding  to  the 
channel  with  the  highest  value  Py(uk).  Then,  selection  of  the  brightest  area 
suffices  for  signal  u^  selection. 
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Each  computer  A*  may  have  the  structure  depicted  in  Figure  5.3 ,  i.  e. ,  comprise 
a  correlator  computing  from  formula  (5.6) ,  summing  stage  CK  ,  and  stage  SK 


S 


Figure  5.3 


with  an  exponential  characteristic,  which  converts  In  P  (u.  )  into  Pu(u.  )}  here, 

y  k  y  k- 

stage  9K  in  this  simplest  case,  when  the  task  is  only  to  determine  which  of  the 

P  (u,  )  values  is  the  greatest,  is  not  required  in  principle  since  this  problem 
y  k 

may  be  solved  by  comparison,  not  of  the  PyCu^)  magnitudes  themselves,  but  through 

comparison  of  their  logarithms,  In  P  (u.  ). 

y  k 

It  is  evident  from  Figure  5.3  that  magnitudes,  depending  on  a  priori 
probabilities  P(uk)  and  anticipated  signal  energies  Qk,  play  the  role  of  biases 
in  the  optimum  receiver  circuit. 


5.2  Binary  Detection 

We  now  will  examine  in  more  detail  the  simplest  case,  called  binary  detection. 

In  this  case,  the  signal  may  have  only  two  values,  one  of  which  identical 
with  zero,  i.  e., 

</,(/)  mm  O;  m,  (0=M0- 


The  problem  boils  down  here,  in  essence,  to  signal  detection,  i.  e. ,  /72 

to  clarification  of  whether  or  not  signal  u  (t)  (and  noise)  is  present  at  receiver 

C 

input  or  a  signal  is  absent  (i.  e.,  only  noise  is  present). 


To  solve  the  problem  of  whether  or  not  there  is  a  signal,  the  receiver  must 
compere  inverse  probabilities  Py(uc)  and  Py(0),  where,  in  accordance  with 
formula  (5.4) , 


In  P,  («*„)  =  In +|, 
In  P„(0)  —  In  P(0). 


(5.10) 


Here  P(uc)  and  P(0)  —  a  priori  probabilities  of  signal  presence  and  absence, 
while  Q  —  signal  energy. 

The  receiver  must  supply  the  answer  "yes"  (signal),  if  In  Py(uc)^>ln  Py(0), 
and  the  answer  "no"  (no  signal),  if  In  P  (u  )^[  In  P  (OK 

y  c  ^  y 

Consequently,  the  receiver  must  supply  the  answer  "yes"  when  this  inequality 
is  satisfied 

(5.11a) 


where 


U . 


-In 


P(Q)  ,  Q  . 
P  <«e)  #•  ’ 


(5.11b) 


(5.11c) 


If  inequality  (5.11a)  is  not  satisfied,  the  receiver  must  supply  the  answer 

"no." 


It  follows  from  formulas  (5.11a)  and  (5.11c)  that  the  structure  of  the  optimum 
receiver  for  binary  detection  has  the  form  depicted  in  Figure  5.4.  The  correlator 
computes  £  from  formula  (5.11c) ;  the  £  value  found  is  compared  at  moment 
t  *  T  with  threshold  (constant  negative  bias)  UQ.  If  the  result  is  £  UQ, 
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Figure  5.4.  (a)  —  Correlator. 

i.  e.,  u,ut'  !>0,  the  receiver  supplies  the  answer  "yes"  (signal).  Otherwise, 
it  supplies  the  answer  "no"  (no  signal,  only  noise). 


As  noted  in  §  4.3,  the  correlator  in  several  cases  may  be  replaced  by  a  matched 
linear  filter  or  by  a  correlation- filtration  device.* 


In  the  Figure  5.4  circuit,  correlator  output  and  threshold  bias  are  characterized 
by  d intension less  magnitudes  £  and  Uq  determined  by  relationships  (5.11).  However, 
in  an  actual  circuit,  correlator  output  voltage  (or  that  of  the  matched  linear  /73 
filter  replacing  it)  at  the  moment  of  comparison  with  the  threshold  equals 


*IMX 


(T)-cjy(/)  «e  (/)«//. 


(5.12) 


where  c  —  some  proportionality  coefficient  conveniently  determined  from  the  following 
circumstances.  When  noise  is  absent  (or  in  the  presence  of  a  signal  considerably 
more  intense  than  the  noise),  correlator  (or  matched  filter)  output  voltage  at 
moment  t  =  T  equals 


Mc  rat  CO  =*c  ^  «„  (/)  a9  (0  dt  ~cQ, 


where  Q  —  signal  energy.  Consequently, 


"Technical  advantages  and  shortcomings  of  replacing  a  correlator  with  a  matched 
filter  are  examined  in  [136,  188,  and  others]. 


I 


C  =  Uc 


(5.13) 


It  follows  from  substitution  of  expressions  (5.11)  and  (5.12)  that  threshold 
bias  at  actual  circuit  output  must  equal 


(5.14) 


Sometimes,  it  is  more  convenient  to  determine  parameters  c  and  Uov  from  extant 
noise  voltage  Um  output  value,  rather  then  from  usable  signal  output  voltage 

Since  in  accordance  with  (2.34) 


then,  instead  of  (5.13)  and  (5.14),  one  may  assume 


VSUw  . 
Vtt  ' 


(5.13a 


It  follows  from  (5.llb)  and  (5.14)  that 


(5.14a 


(5.15) 


m 


■  '4*'  'iPr ^  1 


Ill  particular,  if  P(0)  =  P(uJ,  then 


“  o  Ucuux  (‘O* 


i*  0« ,  threshold  bias 
output  voltage. 


must  equal  one-half  the  maximum  value  of  usable  signal  /74 


Too  types  of  errors  are  possible  when  this  type  detector  is  used:  false 
alarms,  i.  e.,  "yea”  when  in  actuality  there  is  no  signal,  and  signal  misses, 

i.  e.,  "no"  when  there  actually  is  a  signal. 

,  % 

We  will  designate  the  probabilities  of  false  alarms  and  signal  misses 
P»t  and  ^np  t  respectively.  Then,  composite  error  probability  equals 


Pm*mp***mmr 


(5.id) 


Since  a  priori  p: 
it  remains  to  compute 


P(0)  and  P(uc)  are  assumed  to  be  known,  then 
error  probabilities  pat  .and  P*a  . 


Initially,  we  will  find  false  alarm  probability  Pat  .  A  false  alarm  means 
nyes"  whan  there  is  no  signal.  Therefore,  the  false  alarm  probability  equals 
the  probability  of  satisfying  inequality  (5.11a)  whan  there  is  no  signal,  i.  e., 
whan 


It  follows  from  (5.11c)  that,  in  this  csss 


(5.17) 


end  the  false  alarm  probability  is  the  probability  that  random  magnitude  £  , 
determined  from  formula  (5.17),  will  exceod  threshold  UQ. 


We  will  find  the  law  of  magnitude  £  distribution.  Qn  the  basis  of  formula 
(1.12),  one  may  write 


hm  (o  —  2  k  'fk  (0  * 

k— I 

«c(0  =  2  “c  *%(*)■ 

*- 1 


(5.18) 


Substituting  these  expressions  into  (5.17)  and  considering  relationship  (1.12c), 


we  obtain 


■  It  M* 


(5.19) 


where  N  =  . 

Since  signal  uQ(t)  is  precisely  known,  then  values  uck  included  in  formula 
(5.19)  are  not  random  magnitudes. 

Ordinates  umh  of  white  noise  «m(0  statistically  are  independent  random 
magnitudes  with  a  normal  law  of  distribution,  zero  average  values,  and  dispersions 
equalling  N  (see  §  1.3). 

Therefore,  terms  Hm»(<W)V)  in  the  sum  of  (5.19)  also  are  independent  /75 
random  magnitudes  with  a  normal  law  of  distribution,  zero  average  values,  and 
dispersions  equalling 


Consequently,  magnitude  £  determined  from  formula  (5.19)  also  has  a  normal 
law  of  distribution  with  zero  average  values  and  with  dispersion  equalling 
the  sum  of  the  dispersions  n  of  the  terms,  i.  e., 


P{  l)es 

j/  2nO| 


(5.20) 


V  v.  --  -•#  ■  .  „ 


But,  it  follows  from  (1.12d)  that 


n  T 

2  ulk  =*2/^  u*(t)dt  =  2f,Q; 

k—  I  o 


therefore 


<r?  = 


2 Q 


1  N ,  ' 

The  probability  that  £  will  exceed  threshold  U  equals 

w  O 

_ j _ 


y. 


(5.21) 


i.  e., 


where 


(5.22) 


■„  P(Q)  ,  g 

_ v*  _  /*  («e>  ^  -V, 

®*  "  ~  ~ - 


/ 


l/lQ 
r  a* 


We  now  will  find  signal  miss  probability  Pn p  . 


(5.23) 


It  follows  from  Figure  5.4  that  there  will  be  a  signal  miss  in  those  cases 
when,  given  a  signal  at  input,  i.  e.,  when 


y(t)  —  we  (/)  H' m  ,  U), 


(5.24) 


it  turns  out  that 

l '  uo-  (5.25) 

Therefore,  miss  probability  is  the  probability  that  inequality  (5.25)  /76 

is  satisfied,  in  which  £  is  determined  by  formulas  (5.11c)  and  (5.24),  while 
it  is  precisely  that 

r 

i  =  Jrj*  lu0(t)+um(t)\u6(t)dt. 

Expanding  the  parentheses,  we  obtain 

r 

(5.26) 

The  first  term  in  this  expression  already  has  been  investigated  in  this 

section  [see  formula  (5.17)];  here  we  found  that  it  has  a  normal  law  of  distribution 

with  a  zero  average  value  and  dispersion  2Q/Nq.  Therefore,  magnitude  £  , 

determined  from  relationship  (5.26),  has  a  normal  law  of  distribution  with  average 

value  2Q/N  and  dispersion  2Q/N  also,  i.  e.,  in  this  case 
o  o 


P{1) 


where  is  determined  as  usual  from  formula  (5.21). 


(5.27a) 
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Therefore,  the  miss  probsbility,  i.  e.,  the  probability  that  £  U  , 


equals 


■  rV(‘-?)7’v 

/2jtos  J 


After  appropriate  replacement  of  variables,  we  obtain 


'■-“7 


(5.27b) 


where 


q  p(Q) 

P(«c) 


(5.27c) 


It  is  possible  to  use  formulas  (5.16),  (5.22),  and  (5.27b)  to  compute  the 

dependence  of  error  probabilities  Pota,  PM1  and  Pnp  on  ratio  P(0)/P(u  )  of  the 

c 

a  priori  probabilities  and  signal-to-noise  ratio  Q/Nq. 

However,  in  those  instances  when  permissible  error  probabilities  are  slight 
(do  not  exceed  0,  1),  requisite  signal-to-noise  ratio  Q/Nq  is  so  large  that  /77 
it  is  possible  to  use  an  asymptotic  expansion  of  the  probability  integral 


T srl'-""-", 


(5.28a) 


and  precisely  the  expansion 


i  •-*/* 


(5.28b) 
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Here,  formulas  (5.22)  and  (5.27b)  take  on  the  following  form: 


“T  I'ISa,  "p  I  5^*. 


hence 


(5.29J 


In— - - 0,9  — In a„ 

2  “at 

« In  — ^ - 0,9—  In 


If  and  ,  it  is  possible  to  assume  that 


*-"ib 


and,  considering  expressions  (5.23)  and  (5.27b),  we  obtain 


(5.30c 


(5.30t 


^(l/^7  +  lb)’' 


(5.31) 


P(«»o) 


(5.32) 


Here  and  elsewhere,  q  =  Q/Nq.  Formulas  (5.31)  and  (5.32)  will  become  precise 


pnv  -*■  °- 


Due  to  the  rise  in  magnitudes  and  Pov  ,  the  error  of  expressions  (5.31) 


and  (5.32)  increases.  As  computations  show,  when  /% T  <  <>, I  and  ^np  K  0, 1  ,  instead 
of  (5.31),  it  is  more  advisible  to  use  the  following  corrected  formula: 


,  *  (l/,n_L._M  +  )/ 

The  error  in  determination  of  q  does  not  exceed  0.5  cb  if  and  /78 

Pup  <0,1  .  Where  P„T-+-0  and  Pnp-*-0  ,  the  error  asymptotically  will  strive 

towards  zero.  It  follows  from  formula  (5.16)  and  (5.32)  that 


P  —  Pom  p 

nT  2P( 0)'  2P(«,)  *  <5*33) 

If  a  priori  probabilities  P(0)  and  P(u  )  and  permissible  ccxnposite  error  probsbility 

w 

Pom  are  given,  then  it  is  possible  to  use  formulas  (5.33)  to  compute  permissible 
probabilities  P„ T  and  ^np  and  then  to  use  formula  (5.31a)  to  determine  requisite 
signal-to-noise  ratio  q. 

According  to  problem  conditions,  a  priori  probabilities  P(0)  and  P(u  )  often 

V 

are  unknown.  Here,  formula  (5.16)  of  composite  probability  Pom  becomes  senseless 
and  should  be  provided,  not  by  permissible  magnitude  Poia  ,  but  by  permissible 
conditional  false  alarm  and  signal  miss  probabilities  Pur  and  Pn P  .  Here,  requisite 
signal-to-noise  ratio  q  also  is  determined  from  formula  (5.31a). 

During  derivation  of  formula  (5.31a),  it  was  assumed  that  threshold  bias 
Uq  at  optimum  receiver  output  is  determined  from  formula  (5.11b),  and  it  is  precisely 
that 


If  a  priori  probabilities  P(0)  and  P(u  )  are  unknown  and  we  are  given  directly 

C 

conditional  probabilities  Pn  T  and  Pap  ,  then,  in  accordance  with  expression 
(5.32),  ratio  P(0)/P(u  )  in  this  formula  should  be  replaced  by  P9VIP»r  ,  i.  e., 

C 

one  should  assume 
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Here,  formula  (5.14a)  takes  on  the  following  form: 


U+ 


Vm 


(5.34a) 


However,  for  given  probability  PM,  ,  it  is  possible  to  obtain  a  simpler  formula 
for  determination  of  the  requisite  real  threshold  U ^  . 


Actually,  in  absence  of  a  signal,  voltage  at  moment  t  s  T  at  correlator  output 
(or  at  the  output  of  the  matched  filter  replacing  it)  has  a  normal  law  of  distribution 
with  a  zero  expected  value  and  dispersion  equalling  Um*  .  Therefore, 


I* 


“  ,T,/*t'-duw„  (7) 


vfc  1 
</y. 


e-#,'*<fe. 


Where  P»?<0,1  (this  usually  is  the  case),  it  is  possible  with  sufficient  /?9 
precision  to  assume 


i  • 

where  oi^UtvIUm  ,  Hence,  it  follows  that 

I  _ 

UtP»Um]/r2ln-7L-  (5.34b) 

9  "nr 


(such  a  result,  of  course,  is  obtained  also  from  formula  (5.34a)  if  expression 
(5.31)  is  substituted  into  it  for  q). 


Since  in  accordance  with  (2.34) 


ue  mi  (T)  |  f  2Q 
Um  V  N% 


II  —  ggjgj  ID 

Um  vw  ’ 


and  fomula  (5.34b)  may  be  represented  al80  in  the  following  form: 


U -OsaaSlLi/’ \n-±-  , 
H  Vi  V  Pi it 


(5.34c) 


where  ueBtls(7)  ~  maximum  (peak)  signal  voltage  value  at  correlator  (or  its 
replacement  matched  filter)  output. 

It  follows  from  the  formulas  presented  that,  during  binary  sicyial  detection, 
detection  reliability  (error  probability)  will  not  depend  on  signal  uc(t)  shape 
and  will  depend  only  on  its  energy  Q. 

5.3  Discrimination  of  Two  Non-Zero  Signals 

In  the  preceding  section,  we  examined  a  case  where  one  of  two  possible  signals 
[uQ(t)  and  Ul(t)]  is  identical  with  zero.  It  is  possible  analogously  to  examine 
a  more  common  case  where  both  possible  signals,  u^(t)  and  U2(t),  are  non-zero 
signals.  V.  A.  Kotel'nikov  obtained  the  following  results  for  this  case  [1]. 
Composite  error  probability  Pm  equals 


Pm-Pfat)Pfat  am.  U|)  +  P(«i)P(«i  »M.  «,), 


(5.35) 


where  P(u^)  end  P(u^)  —  a  priori  probabilities  of  the  presence  of  signals  u^ 

and  Uj,  while  P{ u,  am.  w,)  and  Pfaim.it,)  —  conditional  probabilities  of  reception 

of  signal  Uj  instead  of  and  signal  u^  instead  of  respectively.  Here  /BO 


_V  *  '  •.*  %** 


P  (Uj  bm.  «i)  =  V  (a„), 


(5.36) 


*  «  y*  T  ■  it  .  V-  JV*  ™  •  •  • 


v  *w  v.  v  v:  ■ 


where 


(5.28a) 


11  2*  P(llt) 


(5.37) 


1 


T 

** — j*  l«i  (0 — «i  .(01*  dt. 


(5.38) 


It  is  sufficient  in  formulas  (5.36) — (5.38)  to  replace  indices  1  and  2  here 
and  there  to  find  probability  P(ut  bm.  «,)  . 

Analysis  of  these  relationships  demonstrates  that  error  probabilities 
P(u t  bm.  «,).  P(ut  bm.  aj  and  Pm  decrease  as  magnitude  Ct  rises.  But,  it  follows 
from  formula  (5.38)  that  magnitude  C(  is  maximum  if 


(5.39) 


therefore,  relationship  (5.39)  insures  receipt  of  minimum  error  probabilities. 
Here,  signals  u^(t)  and  Uj(t)  have  identical  energy 


where 


Gi-j  «»•(*><«; 


(5.40) 


If  signals  Uj(t)  and  u2(t)  have  identical  energy  (Q2  s  Q^),  but  satisfy  an 
orthogonality  condition  rather  than  condition  (5.39),  i.  e., 

T 

(5.42) 

then,  from  (5.38)  we  obtain: 


a* 


(5.43) 


Finally,  in  the  case  of  binary  detection,  when  one  of  the  signals  (u2,  for 
example)  is  identical  with  zero  (passive  spacing),  from  (5.38)  one  obtains 


«»«-£-.  (5.44) 

2*. 

In  phase-shift  keying  [PSK]  telegraph  communications,  relationship  (5.39)  /81 
occurs,  while  relationship  (5.42)  occurs  during  frequency-shift  key  [FSK]  with 
sufficient  frequency  f^  and  f2  separation;  binary  detection  corresponds  to  telegraph 
transmission  with  passive  spacing. 

Obtaining  identical  signal  reproduction  reliability  in  all  these  cases  (i.  e., 
identical  error  probabilities)  requires  an  identical  parameter  CX  value.  But, 
it  follows  from  comparison  of  formulas  (5.41),  (5.43),  and  (5.44)  that  obtaining 
an  identical  CX  value  requires  signal  energy  Q  greater  by  a  factor  of  2  for  FSK 
and  greater  by  a  factor  of  4  during  passive  spacing  than  during  PSK.  Consequently, 

FSK  provides  a  loss  of  required  energy  greater  by  a  factor  of  2,  and,  for  transmission 
with  passive  spacing,  greater  by  a  factor  of  4  than  is  the  case  for  PSK. 


It  should  be  noted  here,  however,  that,  in  active  spacing  systems  (FSK  and 


PSK,  for  example),  energy  Q  will  be  expended  during  transmission  of  any  of  the 
signals  (u^  or  U2),  while,  in  systems  with  passive  spacing,  energy  is  not  expended 
in  the  spacings.  Therefore,  if  pulses  and  spacings,  for  example,  have  identical 
duration  and  a  priori  probabilities  P(u^)  and  P^),  respectively,  then  the  average 
energy  that  will  be  expended  during  a  large  number  of  tests*,  in  the  case  of  passive 
spacing  equals 

<3cp  nn  =»P(«i)Qrm. 

then,  in  active  spacing  systems,  it  equals 

QcpAn  “QAn; 

therefore 

_^pnn.==p{„l)^illL. 

QepAn  Gaii 

It  follows  from  this  that,  given  P(u^)<^  1,  systems  with  passive  spacing 
are  more  advantageous  than  those  with  active  spacing  (from  the  standpoint  of  expended 
average  energy  Qcp  ) ,  in  spite  of  the  fact  that  Qrm  is  several  times  greater 
than  Q,ui  . 

If  P(u^)  s  0.5,  i.  e.,  pulses  and  spacings  are  equally  probable,  then  from 
the  point  of  view  of  average  energy  Qc p  ,  passive  spacing  systems  and  FSK  systems 
are  equivalent,  while  a  PSK  system  in  comparison  provides  an  energy  saving  greater 
by  a  factor  of  2. 

It  also  is  evident  that  PSK  is  better  than  FSK,  both  from  the  standpoint 
of  each  signal's  energy  Q  and  of  average  energy  Qcp  . 

♦Here,  each  test  will  comprise  determination  of  which  signal  (u^  or  u^)  will 

be  received  during  time  interval  (0,  T),  while  only  one  of  these  signals  may  exist 
simultaneously  in  this  interval. 


5.4  Discrimination  of  m  Orthogonal  Equiprobable  Signals  Having  Identical  Energy  /82 

Calculation  of  receiver  sensitivity  is  very  complicated  in  the  general  case 
of  signals  (or  messages)  with  many  discrete  values.  Therefore,  V.  A.  Kotel'nikov 
only  computed  for  several  frequent  cases,  the  most  significant  being  a  case  of 
discrimination  of  m  orthogonal  equiprobable  signals  having  identical  energy. 

Here,  it  is  assumed  that  the  signal  at  receiver  input  may  have  one  of  m  possible 
ValU68  UjU),  UjCt),  .  .  .,  um(t). 

All  possible  signals  have  identical  energy 

<?i  »<?*  =  •  •=»  Q«-<? 

and  they  satisfy  the  orthogonality  condition,  i.  e., 

T 

J uh (/) u, (/)  dt  =  0  where  1  i  k  (5.45) 

o 

Since  all  signals  have  identical  energy,  then  probability  P(uq)  of  a  zero 
signal  equals  zero.  All  non-zero  signals  are  equally  probable,  i.  e.,  the  a  priori 
probability  of  the  presence  of  each  signal  equals 

P  (uj  =-  P  (u,)  - ...  -  P  (uj  =  J .  (5.46) 

The  most-important  individual  instances  of  orthogonal  signals  are  those  which 
do  not  overlap  in  time  (Figure  5.5)  or  in  frequency  spectrum  (even  though,  generally 
speaking,  those  signals  which  overlap  in  time  and  in  frequency  spectrum,  such 

as  sin  y 1  and  cosy/  ,  also  may  be  orthogonal). 

The  receiver's  job  is  signal  discrimination,  i.  e.,  determination  of  exactly 
which  one  of  the  m  possible  signals  was  present  at  receiver  input  during  observation 
cycle  (0,  T).  Here,  interval  T  is  selected  in  such  a  way  that  it  encompasses 
all  possible  signal  u.  (\)  positions. 


Figure  5.5 


Thus,  for  example,  if  the  possible  orthogonal  signals  take  the  form  depicted 
in  Figure  5.5,  then  the  following  must  be  the  case 

r-mr,  (5.47) 


where  T  —  duration  of  each  signal. 

The  general  optimum  receiver  schematic  depicted  in  Figure  5.2  is  valid  for 
this  case,  but  zero  channel  must  be  deleted. 


Inverse  probabilities  Py(u^)  •  •  •  Py(um)  of  all  m  possible  signals  /83 
are  compered  in  unit  CC  and  that  signal  u^,  the  inverse  probability  of 

which  turns  out  to  be  greatest,  is  selected. 


Formula  (5.4)  is  used  in  the  general  case  to  determine  probability  pyC\)> 

In  this  instance,  P(U|P  and  will  not  depend  on  k  and  formula  (5.4)  may  be  written 
in  the  following  form: 


P»(«»)-***1*. 

T 

5*“^rj^(/K 


(t)dt. 


(5.48) 


kj  —  constant  factor  rat  depending  on  number  k. 


We  will  find  correct  signal  discrimination  probability  pw  .  it  is  determined 
in  the  general  case  from  formula  (5.8} .  In  the  case  examined)  relationships  (5.46) 
occur;  in  addition)  (tie  to  the  identical  nature  of  all  non-zero  channels)  this 
condition  must  be  met 


P*,  (“t)  -•••  -P.*  («J- 


(5.49) 


Therefore,  formula  (5.8)  i8  simplified  and  takes  the  form 


P*§*b  "  ^K)- 


(5.50) 


Here,  P (u^)  is  the  probability  that,  when  signal  uk  is  present  at  input, 
the  receiver  rill  reproduce  that  very  signal  u^,  i.  e.,  magnitude  Py(uk)  will 
turn  out  to  be  the  great rat. 

Consequently,  computation  of  probability  pnv  from  formula  (5.50)  means 
assuming  that  signal  u^Ct)  is  at  input,  i.  e., 


y(0 -«»(/)+ «**(/). 


(5.51) 


Substituting  (5.51)  into  (5.48)  and 
will  obtain 


considering  orthogonality  condition  (5.45), 


0 

T 

*0 

^=£I“"(<)Ufc(/)d/+lr; 


(5.52) 


im  “  ~TT 


(t)dt. 


It  was  demonstrated  in  §  5.2  that  magnitude  n-  j  u,a(t)ui,(t)dl  has  a  normal  /84 

distribution  with  a  zero  average  value  and  dispersion  ^  [see  formulas  (5.17  and 
(5.21)]. 

Consequently,  in  the  case  examined  when  signals  have  identical  energy  Q, 
all  values  of  £^,  ^2»  •  •  •>  £m  (with  the  exception  of  £  k)  determined  from 
formulas  (5.52)  have  the  identical  law  of  distribution  with  a  zero  average  value 
and  dispersion 


(5.53) 


Magnitude  £k  also  has  a  normal  distribution  with  dispersion  <V  ,  but  its 
average  value  equals  not  zero,  but 


f  2Q 


(5.54) 


In  addition,  it  may  be  demonstrated  that,  due  to  the  orthogonality  of  signals 
u^t),  .  .  .,  um(t)»  random  magnitudes  £^,  £2,  •  •  •*  £m  atatisticaiiy  are 
independent. 


Corre^c  reproduction  will  take  place  if  probability  P  (u.)  turns  out  to  be 

£y  " 

k  turns  out  to  be  the  greatest  of  all  values 
£  1*  £2*  •  •  •*  £  m’  This  means  that,  if  parameter  £  ^  turns  out  to  equal 
certain  value  z  (regardless  of  which  one),  then  all  remaining  (m— 1)  magnitudes 

£  ^,  ^2*  *  *  •*  i>m  ^excePt  mus^  be  Tess  than  z* 

Consequently,  probability  Pnp«s  is  the  probability  that,  if  £  k  =  z  (where 
z  may  be  any  value),  then  all  remaining  (m— 1)  magnitudes  •  •  •» 

(except  £  k)  are  less  than  z. 

Since  magnitudes  £  statistically  are  independent  and  have 

an  identical  law  of  distribution,  then  the  probability  that  all  (m-1)  of  such 
magnitudes  simultaneously  will  turn  out  to  be  less  than  z  equals 


IP  (I,  <?)}"-'. 


(5.55) 


where  1  i  k. 


The  probability  that  magnitude  £  k  will  fall  within  the  limits  of  z  to 
z  +  dz  equals 


Wh{z)dz, 


(5.56) 


where  magnitude  £^  law  of  distribution. 

Magnitude  5  k  is  independent  relative  to  all  other  values  .  .  ., 
(except  £  k).  Therefore,  the  probability  that  these  relationships  are  satisfied 
simultaneously 


lt<z  (where  1  /  k) 


S*  =«  -  z+dz, 
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equals 


dP=[P(lt<z)\m-'  Wk  (z)  dz. 


Since  it  is  immaterial  what  value  z  is  for  correct  reproduction,  then 


/85 


J  [P(l,<r)lm"'  K\(z)dz.  (5.57) 

— om 

where  Wk(z)  —  probability  density  of  magnitude  £k,  while  P( 
probability  that  £ z  (where  1  t  k). 

In  the  case  examined,  considering  the  laws  of  distribution  of  magnitudes 
£»•  . .  5* . we  have 


02_  2Q 
AT.’ 


1  5 

J* 


(5.58) 


Substituting  (5.58)  into  (5.57),  we  obtain 


where 


dll. 


(5.59) 


Computations  using  this  formula  are  possible  only  by  means  of  numerical 
integration.  However,  if  the  error  probability  is  slight  (/\,iB<O.I),  it  is  possible 
to  simplify  expression  (5.59)  considerably,  having  employed  asymptotic  expansion 
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of  probability  integral  V(x).  Here,  (5.59)  is  reduced  to  an  approximate  formula 
[106]: 


^«2ln-i-  +  ln(m-l)-2,8-  (5.60) 

,¥0  '  OQI 

The  error  in  this  formula  does  not  exceed  1  db  if  <  0,|  . 

5.5  The  Case  of  an  m-Channel  Receiving  Device 

It  is  interesting  to  note  that  the  relationships  obtained  for  reception  of 
one  of  m  orthogonal  equiprobable  signals  with  identical  energies  are  valid  also 
for  the  m-channel  receiving  device  whose  block  diagram  is  depicted  in  Figure  5.6. 

This  receiving  device  comprises  m  identical  channels,  at  the  input  of  which 
are  active  independent  fixed  noise  voltages  uml(t),  umt(l), . umm{f)  (internal  /86 


Figure  5.6.  (a)  —  Channel  1;  (b)  —  Channel  2; 

(c)  —  Channel  kj  (d)  —  Channel  m. 

noise  of  these  channels,  for  example)  with  normal  laws  of  distribution,  zero  average 
values,  and  identical  dispersions  N  (where  N  s  «*,„*  ). 

Signal  uc(t),  the  shape  of  which  is  precisely  known,  may  be  present  at  input 
of  one  of  these  channels  with  equal  probability  (1/m).  The  job  of  the  receiving 


m 


Figure  5.7 

device  is  to  determine  the  input  of  the  exact  channel  at  which  a  signal  is  present 
during  a  given  observation  cycle  (0,  T).  The  receiver  has  input  voltage  realizations 
y^t),  72^)*  •  •  •»  ym(t)  *°  solve  this  problem  (Figure  5.7).  It  is  known  that 
one  of  these  realizations  will  comprise  the  sum  of  noise  “m*(0  and  signal  uc(t) , 
while  the  remaining  (m—  1)  realizations  will  comprise  only  noise.  The  task  is 
to  determine  precisely  which  of  these  realizations  will  comprise  the  signal. 

Here,  the  assumption  is  that  the  signal  will  last  the  entire  time 

T  -  x. 


Since  noise  is  stationary,  then  it  is  possible  to  place  realization 
y^(t),  .  .  .,  ym(t)  in  time,  rather  than  in  space,  as  depicted  in  Figure  5.8. 
Here,  the  task  boils  down  to  determining  in  which  of  m  sectors  of  composite  reali¬ 
zation  y(t)  the  signal  with  duration  T  is  located  or,  which  is  the  same  thing, 
determining  which  of  m  signals  nonoverlapping  in  time  (orthogonal)  is  in  /87 
this  realization. 


-w-rr  '  ♦,*  i 


'  *  *•/,*'  ‘  “«  '«  *  • 


Figure  5.8 


This  problem  was  examined  above  and  it  was  found  that  its  solution  requires 
comparison  of  inverse  probabilities  P  (u.  )  or,  which  is  the  same  thing,  comparison 

c  y  * 

of  magnitudes  and  to  select  the  greatest. 

Therefore,  in  an  m-channel  receiver  (Figure  5.6),  for  best  solution  of  the 
problem,  each  channel  also  must  comprise  a  computer  of  magnitude  £ k,  i.  e. , 
a  correlator  determining  £^  from  formula 


.  r 

5**^- (5.61) 
*  # 

where  yk(t)  —  oscillation  at  input  of  the  k-th  channel.  Comparison  of  magnitudes 
£  i,  £  2,  .  .  and  selection  of  the  greatest  one  occurs  in  unit  CC. 

As  a  result  of  this  selection,  the  decision  is  made  that  the  signal  is  present 
at  the  input  of  that  channel  in  which  magnitude  £ ^  turned  out  to  be  greatest. 

Since  signal  may  be  present  in  only  one  channel,  the  second  channel,  for 
instance,  then  y^(t)  will  comprise  the  signal-noise  sum  only  in  it: 


in  all  other  channels,  it  will  be 


yh(t)=umh(t)  (where  k  i  2) 


It  is  not  difficult  to  become  convinced  that,  here,  magnitudes 


• » 


£ 


£  1»  £  2* 


m 


have  exactly  the  same  form  as  in  the  case  of  m  orthogonal  signals 


[see  expression  (5.52)].  Therefore,  formulas  (5.57),  (5.59),  and  (5.60)  remain 
valid  if,  in  these  formulas,  m  is  understood  to  mean  the  number  of  receiver  channels, 
while  probability  Pa PM  is  understood  to  mean  the  probability  of  correct  designation 
of  the  channel  in  which  the  signal  is  present. 


CHAPTER  SIX  /88 

RECEPTION  OF  INDIVIDUAL  ANALOG  MESSAGE  VALUES 
6.1  General  Relationships 

It  is  assumed  during  reception  of  individual  analog  message  values  that  message 
x  is  an  analog  random  magnitude,  which  is  constant  during  an  observation  cycle 
(0,  T),  and  changes  from  one  interval  to  another  with  respect  to  random  laws  with 
a  priori  probability  distribution  P(x).  The  limits  of  possible  message  changes, 
jrMaR  and  jrMam  ,  are  assumed  to  be  known. 

Kotel'nikov  for  convenience  assumes  that 

■^unn  ~  1 I*  (6.1) 

Here,  in  accordance  with  a  normality  condition,  the  following  must  be  true 

jP(x)dx-l.  («.2) 

A  receiver  computes  inverse  probability  distribution  P^(x)  determined  from 
formula  (4.11)  and  supplies  that  message  value  at  which  function  Py(x)  is 
maximum. 
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For  simplicity,  it  is  assumed  that  all  message  values  are  equiprobable, 


1.  e. 


P(x)  =  const. 


Here,  expression  (4.11)  takes  the  form 


Pv  (•*)=*»  exp  J  ( y  ( 0 —«*  (01* dt  j . 


(6.3) 


(6.4) 


where  constant  ^  will  be  found  from  normality  condition  (6.2). 

The  assumption  that  all  message  values  are  equally  probable  considerably 
simplifies  the  analysis  and  is  proved,  in  addition,  by  the  following  circumstances: 


1.  In  several  cases,  all  message  values  actually  are  equiprobable. 


|  2.  Often  there  is  absolutely  no  information  available  on  the  law  of  distribution 

P(x).  Here,  the  assumption  concerning  equal  probability  of  all  messages  is  usually 
most  natural. 


In  a  number  of  cases,  equidimensional  distribution  P(x)  is  the  most  favorable, 
i.  e. ,  requires  the  greatest  signal  energy  during  reception  (see  Chapter  17  for 
more  details  on  this).  Consequently,  assuming  distribution  P(x)  to  be  equidi¬ 
mensional,  we  thereby  are  investigating  the  worst  case. 


3.  Given  the  requirement  for  highly-precise  message  reproduction,  i.  e., 
given  a  high  signal- to-noise  ratio,  the  structure  of  the  optimum  receiver  /89 
and  its  sensitivity  essentially  will  not  depend  on  the  type  of  a  priori  distribution 
P(x)  (proof  of  this  point  is  provided  in  Chapter  19).  Therefore,  given  a  high 
signal-to-noise  ratio,  it  is  completely  permissible  for  simplified  analysis  to 
assume  that  distribution  P(x)  is  equidimensional,  even  if  it  is  known  that  this 
s  distribution  may  be  significantly  irregulars 

i 

K  Since  Kotel'nikov  succeeded  in  conducting  the  most-detailed  analysis  of  analog 

1 


m 
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j&jW  •fl-'A-*?---'  At.*?. 


massage  reception  only  for  great  signal-to-noise  ratio  values,  then  his  assumption 
(6.3)  is  completely  valid. 


It  follows  from  (6.4)  that  most-probable  message  value  xvn  must  satisfy  the 
equation* 


m,.  -«• 


(6.5) 


where 


E  (*)  — y  j  \y  (/)  (ON*  -  ((.V (0  (0|V 


(6.6) 


In  expanded  form,  equation  (6.5)  has  the  form 


(uw-s.«Fs®ut-a 


(6.7) 


Next,  the  form  of  function  Py(x)  approximating  its  maximum  will  be  found,  i.  e., 
approximating  most-probable  value 


For  x  approximating  xfmf  it  is  possible  to 


(6.8) 


Substituting  this  expression  into  formula  (6.4)  and  considering  relationship 
(6.7),  we  obtain 


(6.9) 


*It  is  assumed  that  function 


£  (x)  is 


differentiated  with  respect  to  x. 


:-v- 
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where 


i, _ L t\dJ!*l!l2  \ 

at.  I!  dx  KJr  (6.10) 


while  kj  —  constant  not  depending  on  x  and  determined  from  normality  condition 


(6.11) 


If  noise  intensity  Nq  is  sufficiently  slight  (i.  e. ,  the  signal-to-noise 
ratio  is  great),  then  the  indicator  of  power  in  expressions  (6.4)  and  (6.9),  outside 
the  area  in  which  equality  (6.8)  is  valid,  will  become  so  large  relative  /9Q 
to  absolute  magnitude  that  it  is  possible  to  disregard  magnitude  Py(x)  outside 
this  area.  Here,  one  may  consider  that  distribution  (6.9)  is  valid  in  the  area 
of  all  possible  values  of  x  (and  not  only  approximating  -*,»)  and,  in  expression 
(6.11),  it  is  possible  to  change  the  integration  limits  by  —  oo  and  Oo  .  Then, 
from  formula  (6.11),  we  obtain 


Consequently,  given  a  high  signal-to-noise  ratio,  distribution  Py(x)  has 
the  form  of  a  gaussian  curve 

P¥(*)~  ]/  £e-6u.“'r»,\  (6.12) 

Signal  reproduction  quality  is  characterized  by  error 

8  =  Y— *= (6.13) 

Since  x  here  is  understood  to  mean  a  message  normalized  in  such  a  way  that 
its  maximum  magnitude  equals  unity,  then  error  (5  also  turns  out  to  be  normalized 
accordingly. 


The  mean  aqua re  of  the  error  equals 


6’  =(x,  „—*)*=  M  (*„  „—*)*.  (6.14) 

Two  random  magnitudes,  x  and  ,  are  included  in  this  expression;  here, 
magnitude  xUn  is  random  because  it  will  depend  on  the  type  of  realization  y(t) 
of  the  signal-noise  sum  (see  Figure  4.2). 

Initially,  we  will  find  the  conditional  mean  square  of  the  error,  determined 
by  relationship 


*■  (*» a  *)***  j x)* Pg (x) dx.  (6.15) 

It  is  evident  from  (6.15)  that  will  be  found  for  given  realization  y, 

i.  e.,  is  conditional  expected  value  of  the  square  of  the  error. 

Introduction  of  magnitude  is  convenient  because  its  determination  requires 
only  knowing  conditional  distribution  Py(x),  which  already  had  been  found  above. 

For  computation  of  unconditional  mean  square  8*  ,  magnitude  f?  must,  in 
turn,  be  averaged  using  multidimensional  distribution  P(y)  of  random  function 
y(t).  This  second  averaging  in  the  general  case  encounters  great  difficulties. 
However,  as  Kotel'nikov  demonstrated,  given  a  sufficiently -great  signal- to-no is e 
ratio,  magnitude  essentially  will  not  depend  on  y  and,  consequently,  it  is 
possible  to  assume 


T*-??.  (6.16) 

Actually,  for  slight  noise,  from  (6.12)  and  (6.15),  we  obtain  191 


where  b  is  determined  from  formula  (6.10),  i.  e.,  it  is  a  magnitude  not  depending 


on  *vn  ,  and,  consequently,  on  y.  Therefore,  given  slight  noise,  it  is  possible 
to  assume 


(6.17) 


It  follows  from  formulas  (6.10)  and  (6.17)  that  mean  square  error  decreases 

with  a  rise  in  magnitude  aic~'  Jr  *  ©*f  in  the  mean  (for  time  T)  square 

of  the  partial  derivative  of  signal  u^(t)  with  respect  to  message  x.  This  result 
is  fully  understandable  since  the  greater  this  derivative,  the  greater  the  change 
in  oscillations  u  (t)  shape  caused  by  slight  change  Ax  in  message  x.  Therefore, 
the  greater  the  noise  magnitude  must  be  for  the  receiver  to  err  in  detecting  this 
true  message  change. 

Consequently,  the  greater  magnitude  ^  ,  the  higher  the  receiver 

noise  immunity. 


It  follows  from  relationships  (6.12)  and  (6.13)  that,  given  slight  noise. 


p,  («>  - 

Since  magnitude  b  will  not  depend  on  y  in  this  case,  then  we  obtain 


P(*)-P,(«)-|/  £e-*6’ 


i  c-d»/»y» 
I  2x8* 


(6.18) 


i.  e. ,  error  0  has  a  normal  law  of  distribution  P(  0  )  with  a  dispersion  equalling 
3*  . 


Therefore,  given  slight  noise,  the  probability  that  error  0  will  exceed 
magnitude  €  with  respect  to  absolute  value  equals 

where  V(x)  —  probability  integral  determined  from  formula  (5.28a). 
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The  next  important  conclusion  that  Kotel'nikov  obtained  for  a  case  of  slight 
noise  is  that  a  receiver  operating  on  the  maximum  inverse  probability  principle 
provides  minimum  possible  mean  square  error.  Proof  of  this  assumption  will  not 
be  provided  here  since  it  may  be  obtained  as  a  partial  case  of  more  general  theorems 
formulated  in  Chapter  19. 

Thus,  Kotel'nikov  obtained  the  following  important  results  for  slight  /92 
noise,  i.  e.,  for  a  large  signal-to-noise  ratio. 

1.  A  receiver  operating  on  the  maximum  inverse  probability  density  principle 

provides  the  minimally-possible  mean  square  error.  This  error  is  determined 

from  formulas  (6.17)  and  (6.10). 

2.  error  (5  is  subordinate  to  a  normal  law  of  distribution  with  a  zero 
average  value  and  dispersion  8*  (formula  (6.18)]. 


Kotel'nikov  succeeded  in  finding  only  the  lower  limit  of  the  probability 
error  for  a  random  signal-to-noise  ratio,  determined  by  the  following  inequality, 


P(  l«|>e)>  f  V(*t)dx9, 

— <T— «) 


(6.20) 


where 


«i*  — 5J-  j  1“  (*«  +  «»  0— «(*.—«.  t)]*dt; 


u(x9  ±  e,  0  —  function  ux( t)  values  where  x  =**  x9  ±  e 


In  a  number  of  cases,  ai  will  not  depend  on  xq.  Here,  formula  (6.20)  is 
simplified  and  takes  the  form 


P(  |8|>«)>2(l-«)  ?(«,). 


(6.20a) 


Based  on  the  resultant  general  relationships,  Kotel'nikov  compered  the  various 


132 


types  of  modulation  and  reception  methods.  The  main  results  of  this  analysis 
will  be  presented  in  the  next  section. 

6.2  Comparison  of  Different  Modulation  Types  and  Reception  Methods 


In  the  case  of  amplitude  modulation  (AM) 


“*(0  =  (1  +mx)B(t) 


(6.21) 


and 

dx 


Therefore,  formula  (6.10)  takes  the  form 


W.22) 

0 

T 

where  Q,  **  I  B*{l)dt  is  the  energy  of  carrier  oscillation  B(t),  i.  e.,  signal  energy 
o 

in  the  absence  of  modulation. 

Substituting  relationship  (6.22)  into  formula  (6.17),  we  obtain  /93 


([m-f  im*- 


(6.25 


For  several  signal  types,  the  result  of  averaging  using  formula  (6.25b)  will 
not  depend  on  moment  t^,  when  the  observation  began  and  it  is  possible  to  select 
any  magnitude  t^,  to  consider,  for  example,  that  t^  =  0  (as  was  the  case  above) 
or  to  assume  that  t^  s-T/2.  However,  in  a  case  of  FM,  magnitude  t^  significantly 
impacts  upon  averaging  results. 

Actually,  from  relationships  (6.24),  (6.25a),  and  (6.25b)  (where 
U  T  1) ,  we  obtain 


8i= _ _ 

2QQ*  (T*  4- 3/j7" 


(6.26 


This  expression  where  t^  s-T/2  has  a  maximum  equalling 


gi  _  6'v* 

qq«7-*  • 

Where  t^>«  — T/2,  error  decreases  monotonically  when  t^  increases. 


(6.26 


Consequently,  removing  moment  t^  when  observation  begins,  it  is  possible 
in  principle  to  make  a  message  reproduction  error  as  slight  as  desired.  This 
result  is  explained  as  follows. 


The  following  signals  correspond  to  two  message  x  values  x^  and  X2 


«*i  (0  -  U9  /2cos  ((a,  4-O.r,)  t  +  <p„] 


and 


«,,(/)=*(/„  J  2  cos  [  ((o#  4-  Q.tj)  /  4-  <p0  ] . 


The  difference  in  their  phases  equals 

Aip  =  Q  (xt — Jtj)  t  =  \x£lt. 


(6.26c) 


where  Ax  = 

Consequently,  signals  u^(t)  and  u^t)  differ,  not  only  with  respect  /94 
to  frequency,  but  also  with  respect  to  rf  occupation  phase.  Here,  it  is  evident 
from  expression  (6.26c)  that  if  one  begins  observation  of  oscillations  u^(t) 
and  Ux2(t)  over  sufficiently -great  time  t  following  a  frequency  change,  then 
even  slight  message  change  Ax  will  cause  a  very  great  signal  phase  change. 

Consequently,  signal  phase  modulation  (PM]  and  FM  occur,  which  makes  it 
possible  clearly  to  detect  even  the  slightest  message  x  changes  if  selected  moment 
t,  when  the  observation  begins,  is  sufficiently  late. 

However,  realization  of  very  great  noise  immunity  occurring  during  great 
t^  changes  is  hindered  by  the  following  circumstances: 

1.  Since  increased  noise  immunity  is  achieved  due  to  phase  modulation,  initial 
signal  rf  occupation  phase  must  be  precisely  known  at  the  point  of  reception, 
which  does  not  occur  in  a  majority  of  cases. 


2.  The  lag  in  receiver  message  reproduction  rises  with  a  t^  increase. 

3.  Formula  (6.26a)  is  valid  only  given  a  great  signal-to-noise  ratio  and, 
as  demonstrated  in  the  Kotol'nikov  analysis,  the  greater  the  t^,  the  greater  the 
signal-to-noise  ratio  must  be  in  order  for  the  advantage  provided  by  formula  (6.26a) 
actually  to  occur. 

It  is  shown  in  §  13.3  that,  in  those  cases  when  phase  is  unk  .^wn  at 
the  point  of  reception,  error  6r  is  determined  by  formula  (6.26b).  Th^efore, 
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this  case  precisely  is  of  the  greatest  practical  value  and,  in  future,  we  will 
assume  that,  given  signal  FM,  message  reproduction  error  is  determined  by  formula 
(6.26b). 

It  follows  from  comparing  formulas  (6.26b)  and  (6.23)  that  FM  provides  a 
significant  advantage  compared  to  AM  only  where 

£27'»1.  (6.27) 

But,  where  QT  >  1  ,  the  frequency  band  occupied  by  possible  (anticipated) 
signals  turns  out  to  be  considerably  broader  in  the  case  of  FM  than  in  the  case 
of  AM. 

Actually,  let  the  carrier  oscillation  during  the  entire  observation  cycle 
have  the  form  of  a  sinusoid  with  frequency  /e  (Figure  6.1a).  The  spectrum  of 


Figure  6.1 


such  an  oscillation  is  depicted  in  Figure  6.1b.  The  basic  part  of  the  spectrum 
has  width  ^e>  equalling 


/70  *  —  (6.28) 

e  T 

If  such  a  signal  is  amplitude  modulated  by  message  x,  then  the  shape  of  the 
spectrum  remains  unchanged  for  all  values  of  x.  Only  the  scale  with  respect  to 
the  Y-axis  of  curve  E(f)  changes  [since,  during  interval  (0,  T),  message  x  /95 
is  constant  and  changes  only  from  one  observation  cycle  to  another].  Therefore, 
optimum  receiver  bandwidth  must  approximately  equal  /7e  : 

/7«/7e»-I  (6.29) 

In  the  case  of  FM,  the  sinusoid  frequency  change  ranges  from  ft'  acf*’~'£~ 

to  //  “/«  +  ^  .  Therefore,  given  extreme  message  x  values  (x  =  -1  and  x  =  1) , 
anticipated  signal  spectra  have  the  form  depicted  in  Figure  6.1c  and  the  optimum 
receiver  must  have  band 


and,  where  Q7*  >  1  ,  we  obtain 

IV  OT 

Thus,  the  advantage  obtained  during  FM  compared  with  AM  results  only  due  to  receiver 
bandwidth  expansion.  But,  noise  intensity  increases  when  the  bandwidth  is  expended 
and  formula  (6.26b)  no  longer  is  valid  for  great  noise. 
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Kotel ' nikov ' s  approximate  analysis  of  a  case  of  great  noise  using  inequality 
(6.20a)  demonstrated  that,  given  great  noise,  the  advantage  obtained  with  FM  over 
AM  decreases,  while  the  decrease  turns  out  to  be  more  significant,  the  greater 
the  Consequently,  the  nature  of  the  advantages  FM  has  over  AM,  in  the  case 

of  optimum  receivers,  is  identical  to  that  of  known  extant  receivers;  FM  makes 
it  possible  to  increase  noise  immunity  only  through  expansion  of  receiver  bandwidth 
and  this  may  be  realized  only  given  a  sufficiently-great  signs 1-to-noise  ratio, 
i.  e.,  only  when  high  message  reproduction  requirements  are  levied. 

In  the  case  of  pulse-position  modulation  (VIM)  [PPM],  the  high-frequency 
pulse  envelope  displaces  in  time,  depending  upon  transmitted  message  x,  without  /96 
changing  its  shape,  i.  e., 


ux (0  —  x ]  cos (m, t  -f- <p ). 


(6.32) 


Since  observation  time  T  must  encompass  all  possible  pulse  positions  during 
changes  of  x  from  -1  to  1,  magnitude  must  meet  condition  (Figure  6.2) 


*.  +  t 


(6.33) 


where  t*  —  pulse  duration. 


Figure  6.2 


The  following  formula  is  obtained  from  relationships  (6.10),  (6.17),  (6.32) 
and  (6.33a)  for  determination  of  mean  square  error  for  a  great  signal-to-noise 
ratio: 


It  was  assumed  during  derivation  of  this  formula  that  envelope  Uffl(t)  is  a 
slow  time  function  compared  with  cos  (o»#/  +  <p)  . 


Figure  6.3 


If  the  signal  has  a 
envelope  is  described  by 


rectangular  frequency  spectrum  (Figure  6.3),  then  its 
the  equation 


Substituting  this  expression  into  formula  (6.34)  and  assuming 

ar»i, 


(6.3 


(6.3 


we  obtain 


Signal  relative  energy  Q  in  this  case  equals 


/97 


(6.38) 

Therefore,  it  is  possible  to  write  formula  (6.37)  in  the  following  form: 

8»- —**>. 

(6.39) 

Considering  (6.33a),  we  obtain 

Q(or)*'  (6.40) 

Comparison  of  relationships  (6.26b)  and  (6.40)  demonstrates  that,  for  the 
assumptions  made,  noise  immunity  is  identical  for  PPM  and  FM.  In  a  case  of  great 
noise,  PPM  and  FM  properties  also  approximately  coincide:  the  advantage  provided 
by  PPM  compared  to  AM  (or  pulse-amplitude  modulation  AIM  [PAM])  decreases  and 
is  more  significant,  the  greater  the  Q7*. 

Comparison  of  the  noise  immunity  of  optimum  and  real  receivers,  given  a  high 
signal-to-noise  ratio,  will  lead  to  the  following  results: 

1.  Given  amplitude  modulation,  a  receiver  with  a  conventional  amplitude 
detector  and  properly-selected  bandwidths  ahead  of  and  beyond  the  detector  provides 
noise  immunity  approximating  the  optimum. 

2.  Given  pulse-position  modulation,  a  real  receiver  also  may  provide  noise 
immunity  approximating  the  potential  if  the  bandwidth  selected  in  it  is  optimum 
and  pulse  envelope  position  with  respect  to  time  is  computed  with  consideration 
of  the  bias  both  of  the  leading  and  of  the  trailing  edge  of  this  envelope  [1]. 

6.3  Geometric  Interpretation  of  Results 

Results  obtained  become  clearer  when  geometric  models  in  n-dimensional  space 
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are  used.  V.  A.  Kotel'nikov  used  n-dimensional  geometry  for  the  first  time  in 
the  theory  of  signal  reception  [1],  and  it  was  used  by  a  number  of  other  authors, 
including  A.  A.  Kharkov ich  [4]. 


Geometric  interpretation  is  founded  on  use  of  expansion  (1.12),  which,  in 
this  case,  it  is  convenient  to  reduce  to  the  following  form: 


where 


*-i 

(6.41a) 

/98 

fk  t*'  (0  ■  l/2/,7\|>*  (/). 

(6.41b) 

l+k; 

0 

(6.41c) 

o  t 


8 


(6.41d) 


Magnitudes  f^'  and  differ  from  f^  and  ^  by  constant  factors  and  turn 

out  to  be  more  convenient  because  the  mean  square  of  magnitude  ^/k'  equals  unity, 
while  the  sum  of  the  squares  of  f^'  ordinates  equals  oscillation  f(t)  energy  Q. 


Functions  ^^'(t)  are  standard  (known)  unitary  orthogonal  time  functions. 
Therefore,  it  follows  from  expression  (6.41a)  that  any  time  function  (with  a 
restricted  spectrum)  completely  is  determined  by  n  of  its  own  values  f^', 

.  .  .,  fn',  which  may  be  considered  coordinates  of  function  f(t)  in  some  imaginary 
n-dimen8ional  space.  Here,  functions  ^k'(t)  may  be  considered  unitary  directional 
orthogonal  functions,  i.  e. ,  unit  vectors. 


Space  having  more  than  three  dimensions  cannot  be  represented  on  a  drawing, 
but  it  is  possible  to  represent  it  qualitatively  (conditionally)  as  depicted  in 


Figure  6.4  Figure  6.5 

Figure  6.4.  In  the  figure,  ±  *  ^2'*  *  *  *' V^n'  are  unitary  vectors  of  the 
corresponding  coordinate  axes  (unit  vectors),  f^',  f_' ,  .  .  .,  f^'  are  the  coordinates 
of  point  Ap  i.  e.,  ends  of  vector  r^  or  of  the  coordinates  along  the  axes  forming 
this  vector. 

Consequently,  point  (vector  r^)  corresponds  to  some  time  function  f^(t) . 
Another  time  function,  f2(t),  has  another  combination  of  coordinates  f^',  f^', 

.  .  .,  f  •  and,  consequently,  some  other  point  A^  (vector  r2)  of  n-dimensional 
space  corresponds  to  it. 

The  length  of  vector'?  in  an  orthogonal  coordinate  system  is  determined  from 
the  following  expression: 


Yri*+ft'  +  -+W  -  (6.42) 

i.  e. ,  the  length  of  vector  r,  which  corresponds  to  oscillation  f(t),  equals  the 
square  root  of  the  energy  of  this  oscillation. 


We  will  examine  reception  of  discrete  messages  first.  In  this  case,  the 
signal  may  have  one  of  m  non-zero  values  u^(t),  u^(t),  .  .  .,  um(t)  with  energies 


Qp  Qjf  .  .  .,  Qm.  Some  points  u^f  U2#  •  •  *>  um  correspond  to  these  possible 
signals  in  n-dimensionsl  space  (Figure  6.5). 

For  example,  let  signal  U2(t)  be  present  at  receiver  input  during  given  /99 
observation  cycle  (0,  T).  Due  to  presence  of  noise,  total  oscillation  y(t)  at 
receiver  input  equals 

y{t)  =  u,(t)  +  um{t). 


Since  oscillation  y(t)  differs  from  U2(t)  by  the  magnitude  of  noise  voltage 
um(/)  ,  then  corresponding  point  y  in  n-dimensional  space  (Figure  6.5)  is  separated 

from  point  by  distance 


Ar-y/'Q, 


m* 


(6.43) 


where  Qm  —  noise  energy  (conditional),  i.  e., 


T 

Pm  =“1  «*  (()  dt*s  1  V 

0  a/. 


(6.44) 


Voltage  «m  is  a  random  time  function.  Therefore,  vector  Ar  magnitude  and 
direction  also  are  random. 


It  follows  from  (6.44)  that 


3  yn*  =  — 

Vm  2/i  Z  »*  2/. 


'ui* 


/. 


For  white  noise 


■*  r. 


and 
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(6.45) 


DaIaI  *■ 


Consequently,  in  the  esse  of  white  noise 


(6.46) 

where  Are  *  —  mean  square  value  of  distance  Ar  from  total  oscillation  y(t)  /100 

to  true  signal  u^(t)  [in  this  case,  1^(0  ]. 

It  follows  from  formula  (1.25}  that,  for  white  noise,  the  probability  of 
a  given  realization  ura(/)  monotonically  rises  with  a  decrease  in  energy  Q m  of 
this  realization  and  has  the  greatest  value  where  <?m  =  0. 

Thus,  in  n-dimensional  space,  oscillation  y(t)  differs  from  true  signal  uk(t) 
by  random  vector  Ar.  In  the  case  of  normal  white  noise,  the  most-probable  value 
of  distance  Ar  equals  zero,  while  the  mean  square  value  is  determined  from  formula 

(6.46).  The  greater  distance  Ar  is,  the  less  its  probability. 

The  task  of  a  receiver  possessing  oscillation  y(t)  is  establishing  exactly 
which  of  the  m  possible  signals  is  present  at  input.  A  receiver  operating  on 
the  maximum  inverse  probability  principle  each  time  selects  that  signal  for  which 
inverse  probability  Py(x)  is  maximum.  If  all  signals  are  equally  probable,  then, 
as  follows  from  (4.11),  magnitude  Py(x)  is  maximum  if  the  following  expression 
is  minimum 


(6.47) 

o 

Consequently,  given  equiprobable  signals,  the  receiver  must  each  time  select 
that  signal  ux(t)  for  which  magnitude  £  (x)  is  minimum.  But,  it  follows  from 

(6.47)  that  £  (x)  is  the  energy  (conditional)  of  the  difference  of  oscillations 
y(t)  and  ux(t) ;  the  squares  of  distances  in  n-dimensional  space  correspond  to 
energies.  Therefore,  given  equiprobable  signals,  the  optimum  receiver  each  time 
must  select  the  signal  located  closest  in  n-dimensional  space  to  total  oscillation 
y(t). 


Such  a  result  is  fully  understandable.  Actually,  it  was  demonstrated  above 
that  the  shorter  the  distance  Ar  from  oscillation  y(t)  to  true  signal  u  (t),  the 


more  probable  it  is.  Therefore,  if  all  anticipated  signals  are  equally  probable, 
that  signal  that  is  the  least  distance  from  oscillation  y(t)  should  be  selected 
as  the  true  signal  to  obtain  minimum  error  probability. 

Now,  we  will  examine  reception  of  individual  analog  message  values.  In  this 
case,  message  x  may  have  any  value  ranging  from  -1  to  1;  therefore,  a  continuous 


Figure  6.6  Figure  6.7 


multidimensional  line,  called  the  signal  line  (Figure  6.6),  rather  than  a  collection 
of  individual  points,  represents  the  aggregate  of  anticipated  signals  in 
n-dimensional  space. 

Point  y,  corresponding  to  total  oscillation  y(t),  is  separated  from  point  x, 
corresponding  to  the  signal  carrying  true  message  (x),  by  a  distance  equalling 

✓5T  • 

If,  as  Kotel'nikov  assumed,  all  message  x  values  are  equally  probable,  then 
the  receiver  selects  as  the  true  value  that  message  x  value  to  which  minimum  magnitude 
£(x)  corresponds  [formula  (6.47)],  i.  e.,  that  point  on  the  signal  line,  /101 
located  the  minimum  distance  from  point  y  (point  *»«  in  Figure  6.6).  Here,  a 
certain  error  is  made 

It  is  evident  from  Figure  6.6  that,  for  a  given  noise  energy,  the  relative 
error  will  be  less,  the  greater  the  entire  signal  line  length.  Therefore,  one 


i 


i 


4 

i 

i . 

l:. 

K 


t 


1 


1 


1 


should  strive  to  increase  signal  line  length  to  increase  noise  immunity. 

For  AM,  only  energy  Qx  of  oscillation  u^(t)  will  depend  on  magnitude  of  x, 
while  the  shape  of  this  oscillation  remains  unchanged.  Therefore,  the  corresponding 
signal  line  has  the  form  of  an  n-dimensional  straight  line,  as  depicted  conditionally 
in  Figure  6.7.  For  modulation  capabability  equalling  100%,  signal  energy  changes 
from  zero  to  4Qq,  where  Qq  —  carrier  oscillation  energy.  It  is  possible  here 
to  increase  signal  line  length  only  by  increasing  signal  energy  Qq. 

Consequently,  for  AM,  it  is  possible  to  increase  noise  immunity  (decrease 
mean  square  error  6*  )  only  by  increasing  energy  Q  (for  given  noise  intensity), 
which  is  fully  compatible  with  relationship  (6.23). 

For  FM,  signal  energy  does  not  change  in  the  modulation  process.  Consequently, 
all  signal  line  points  must  be  located  from  the  origin  at  identical  distance 
YQ  ,  i.  e.,  the  signal  line  must  be  located  on  the  surface  of  a  hypersphere 


Figure  6.8 

(in  n-dimensional  space,  a  hypersphere  is  the  analog  of  a  sphere  of  three-dimensional 
space).  Therefore,  in  the  case  of  FM,  it  is  possible  to  draw  the  signal  line 
as  shown  in  Figure  6.8,  i.  e.,  in  the  form  of  some  line  on  the  surface  of  a  sphere 
with  a  radius  equalling  /Q  ,  where  Q  —  signal  energy. 

It  is  evident  from  comparing  Figures  6.7  and  6.8  that,  in  the  case  of  FM, 
signal  line  length  may  be  much  greater  then  for  AM  (given  identical  signal  /102 
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energy)  and,  thanks  to  this,  given  slight  noise,  the  mean  square  error  will  be 
significantly  less.  However,  in  the  case  of  great  noise,  the  advantage  will 
disappear  due  to  onset  of  so-called  anomalous  errors. 

Actually,  if  noise  energy  Qm  is  so  great  that  point  y  is  thrown  aside  from 
point  x,  corresponding  to  the  true  signal  (and,  thereby,  to  the  true  message), 
to  a  segment  exceeding  one-half  the  distance  between  adjacent  turns  of  the  spiral, 
then  point  X2»  located  on  the  adjacent  turn  of  the  spiral,  rather  than  point  x^ 
will  be  at  the  shortest  distance  from  y.  In  this  case,  the  receiver  will  reproduce 
message  i*  ®*»  make  an  error  corresponding  to  a  complete  turn  of  the  spiral, 

rather  than  to  a  small  portion  of  a  turn.  Such  an  abnormally-large  error  is  called 
an  anomalous  error.  It  is  evident  that  a  threshold  for  onset  of  such  an  error 
exists— a  slight  signal-to-noise  increase  may  lead  to  the  transition  from  a  normal 
error  to  an  anomalous  error. 
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For  FM,  an  error  decrease  may  be  attained  both  through  increasing  signal 
energy  Q  and  through  increasing  the  number  of  dimensions  n.  Actually,  the  surface 
of  the  hypersphere  rises  with  an  increase  in  Q  and,  consequently,  so  does  the 
length  of  the  signal  line  located  on  it. 

We  will  examine  the  simplest  case,  depicted  in  Figure  6.9,  to  explain  the 
impact  of  the  number  of  dimensions.  Here,  n  equals  unity,  two,  and  three.  It 


h 


Figure  6.9 


is  evident  from  this  figure  that,  given  constant  signal  energy  Q,  i.  e.,  given 
constant  magnitude  r9  =  /{T  ,  possible  signal  line  length  also  rises  with  a  rise 
in  n.  But,  n  s  2 fwT  ;  consequently,  in  the  case  of  FM,  the  error  must  decrease, 
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both  with  a  rise  in  signal  energy  Q  and  with  a  rise  in  the  number  of  dimensions, 
i.  e.,  with  a  rise  in  the  product  of  band  ft  during  observation  cycle  T.  This 
result  fully  is  compatible  with  formula  (6.26b).  Mean  noise  energy  [see  formula 
(6.45)]  also  rises  simultaneously  with  a  rise  in  the  number  of  dimensions  2ftT  . 
Therefore,  too  great  an  increase  in  product  f»T  will  lead  to  onset  of  anomalous 
errors  and,  consequently,  to  disappearance  of  the  advantage  FM  supplies  over  AM. 

Thus,  all  conclusions  obtained  in  §  6.2  analytically  have  clear  geometric 
confirmation. 

In  the  case  of  PPM  and  other  modulation  methods  in  which  oscillation  energy 
remains  unchanged  in  the  modulation  process,  Figure  6.8  remains  qualitatively 
valid  and  analogous  results  occur — where  ^  ,  a  significant  advantage  accrues 

compared  with  AM,  but,  here,  a  signal-to-noise  ratio  threshold  value  exists.  /103 
The  probability  of  anomalous  errors  rises  radically  when  this  threshold  is  exceeded. 

It  is  necessary  to  increase  signal  energy  or  the  number  of  dimensions  2 f„T  t 
i.  e. ,  bandwidth  or  observation  cycle,  to  decrease  mean  square  error  Y 6*  in  the 
modulation  types  examined  above. 

Kotel'nikov  examined  the  possibility  of  decreasing  error  P  ,  given  constant 
signal  energies  and  number  of  dimensions.  If  signal  energy  and  number  of  dimensions 
remain  unchanged,  then  the  hypersurface  on  which  the  signal  line  is  located  also 
remains  unchanged.  Therefore,  the  signal  line  length  increase  required  to  decrease 
error  may  be  achieved  only  by  increasing  the  sinuosity  of  this  line,  i.  e.  for 
example,  by  more  dense  placer,;ent  of  the  spiral  turns.  Evidently,  great  signal 
line  sinuosity  corresponds  to  a  more  complex  type  of  signal  modulation,  such  as 
double  modulation.  Therefore,  when  Q  and  f*T  are  unchanged,  it  is  possible  to 
achieve  a  decrease  in  error  only  through  use  of  more  complex  modulation  types. 

Since  adjacent  turns  of  the  spiral  (signal  line)  turn  out  here  to  be  closer  to 
each  other,  then  anomalous  errors  will  arise  during  a  lesser  signal-to-noise  ratio 
than  is  the  case  for  simple  modulation. 


Consequently,  it  is  possible  to  reduce  error  when  Q  and  f*T  remain  unchanged 
only  in  the  event  of  ,  very  slight  signal-to-noise  ratio.  These  qualitative  results 
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stemming  from  representations  in  n-dimensional  space  are  completely  confirmed 
quantitatively. 

Thus,  for  example,  it  has  been  demonstrated  that  if  one  simultaneously  uses 
carrier  FM  and  envelope  PM  with  respect  to  the  law 

ux  (0  “  U0  ( I  4-  cos  (Q0  t+ax)  j  •  cos  (u>„  -f  U*)  t, 

then  the  increase  in  parameter  a  makes  it  possible  to  decrease  error  ft*  without 
a  change  in  signal  energy  and  spectrum  width.  However,  the  greater  parameter 
a,  given  less  signal-to-noise  ratio,  then  anomalous  errors  arise  and,  consequently, 
the  acK/antage  obtained  with  such  complex  modulation  disappears. 


a 


CHAPTER  SEVEN 


RECEPTION  OF  OSCILLATIONS 


I 


7.1  Basic  Relationships 

The  term  reception  of  oscillations  is  understood  to  mean  reception  of  messages 
changing  so  rapidly  that,  during  observation  cycle  (0,  T),  they  must  be  considered 
time  function  x(t).  It  is  assumed  for  simplicity  that  function  x(t)  is  constrained 
by  limits  of  +  1. 

The  receiver  computes  inverse  probability  distribution  Py(x)  determined  from 
formula  (A. 11)  and  supplies  that  message  value  *r«(0  at  which  function  Py(x)  is 
maximum: 


?<*)-**«  (0- 


(7.1) 


It  is  assumed  that  all  possible  realizations  of  the  random  function  are  equally 
probable,  i.  e., 

P(*)  —const  where  1*1  <1; 

P (x)  —  0  where  |x|>l. 
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Here,  the  inverse  probability  density  maximum  corresponds  to  the  minimum 
with  respect  to  x  of  magnitude 


(7.2) 


Message  reproduction  error  equals 


(7.3) 


i.  e. , 


?(/)«*  </)  +  «(<). 


where  (t)  —  oscillation  at  receiver  output. 


(7.4) 


It  is  assumed  that,  when  there  is  no  noise,  the  receiver  does  not  supply 
distortions,  i.  e. , 

7(0 -*(«); 

consequently,  function  (5  (t)  also  may  be  considered  a  component  of  output  oscillation 
7  (t>  caused  by  noise  action,  i.  e.,  a  noise  oscillation  at  receiver  output. 

Here,  since  function  x(t)  is  normalized  so  that  it  is  dimensionless  and  included 
in  the  range  of  +  l,  then  oscillations  7m  and  (5  (t)  also  are  normalized. 

The  real  ( non-no rmalized)  oscillation  at  receiver  output  may  be  written  in 
the  form 


“»« (0  (0  +  u»* (0. 


(7.5^ 


where  ua(0  —  usable  voltage  at  receiver  output  obtained  in  the  absence  of  /105 
noise,  while  u«e(/)  —  additional  fluctuating  component  created  by  noise  action. 


**  /,  *  _  • *  •.*•»’»  • 


Consequently , 
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where  k  —  some  proportionality  coefficient. 


(7.6) 


Let  Uua  be  the  maximum  value  of  output  oscillation  “o(0i  corresponding  to 
jk(/>  »  r  =>  U  then,  from  (7.6)  we  have 


*  =  B. 

Substituting  (7.6)  and  (7.7)  into  (7.5),  we  obtain 

It  follows  from  comparison  of  expressions  (7.4)  and  (7.8)  that 


‘'■a 


(7.7) 


(7.8) 
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Kotel'nikov  succeeded  in  concluding  the  mathematical  analysis  of  oscillation 
reception,  only  given  the  following  basic  assumptions: 

1.  The  signal-to-noise  ratio  is  sufficiently  slight. 

2.  Message  x(t)  will  not  comprise  frequencies  higher  than  some  frequency 
limit  Ft  ,  which  is  much  less  than  signal  carrier  frequency  fQ. 

3.  Receiver  structure  is  such  that,  when  there  is  no  noise,  the  receiver 
reproduces  the  message  absolutely  correctly;  the  structure  remains  unchanged  when 
there  is  noise. 


F 


The  concept  of  these  assumptions  will  be  examined  in  §  7.3. 

The  following  basic  results  were  obtained,  given  the  aforementioned  assumptions 


1.  Function  0  (t)  is  a  stationary  fluctuating  oscillation  with  a  normal 
law  of  distribution  with  a  zero  average  value.  Therefore,  (5  (t)  is  characterized 
fully  by  its  own  power  spectrum*  £*■(/)  and 


where 


P-\'El(f)df. 


Ft  —  lowest  frequency  contained  in  message  x(t). 


(7.10) 


2.  A  receiver  operating  on  the  maximum  inverse  probability  density  principle 
insures  receipt  of  the  minimum  mean  square  error  and  minimum  signal-to-noise  ratio 
at  receiver  output. 


3.  The  type  of  power  spectrum  £**(/)  will  depend  on  modulation  type.  /106 
Possible  modulation  types  are  categorized  as  direct  and  indirect.  Those  modulation 
types  in  which  signal  ux(t)  is  linked  with  message  x(t)  with  the  aid  of  some  type 
of  operator  (an  integral  or  differentiator,  for  instance)  are  referred  to  as  indirect 
modulation  types.  If  message  x(t)  is  included  directly  in  signal  u^(t) ,  rather 
than  under  the  sign  of  an  operator,  then  the  corresponding  modulation  type  is 
referred  to  as  direct  modulation. 


For  AM 


«*  (0  -  IM 1  +  m*  (/)  J  cos  (m,  <  +  9,). 


For  PM 


u,  (0  -0, cos  K  t  +  X  (0|. 


(7.11) 


(7.12) 


*It  is  assumed  that  spectrum  will  not  comprise  negative  frequencies. 
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For  FM 

«*(0  - (/,  cos  [(a,/  +  0  J.«  (/)  dt] .  (7.13) 

Consequently,  AM  and  PM  are  direct  modulation  types,  while  FM  is  an  indirect 


type. 


Those  indirect  modulation  types  in  which  x(t)  is  included  under  the  sign 
of  an  integral  are  referred  to  as  integral  modulation  types.  Consequently,  FM 
is  an  integral  modulation  type. 

For  direct  modulation  types  we  obtain 


Elif) 


N, 


([W)r 


(7.14) 


and  for  integral  types 


El  (!)> 


l[W)r  ’ 


(7.15) 


where  ta"*$x\t)dt 


The  formulas  have  a  slightly  more  complex  form  for  pulse  modulation  types 


Cl]. 


It  follows  from  formulas  (7.14)  and  (7.15)  that  the  noiae  power  spectrum 
at  receiver  output  in  the  case  of  direct  modulation  types  is  equidimensional  within 
a  band  range  of  (Fm  —  F,) ,  while  it  is  parabolic  for  integral  modulation. 

Substitution  of  expressions  (7.11)— (7.13)  into  formulas  (7.14)  and  (7.15) 
provide  the  following  results! 
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for  PM 


for  FM 


(7.17) 

/107 


**-%(¥?•  (7*ia) 

We  will  designate  ^mo  —  mean  square  noise  oscillation  value  at  receiver  output 


u—y?ra. 


It  follows  from  formulas  (7.9),  (7.10),  and  (7.19)  that 


(7.19) 


(7.20) 


where  UmaHJua  —  ratio  of  the  mean  square  noise  oscillation  value  to  maximum 
signal  oscillation  value. 

Substituting  expressions  (7.16),  (7.17),  and  (7.18)  into  formula  (7.20)  and 
assuming  for  simplicity  that  F*  s  0,  we  obtain: 


for  AM 
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(7.21) 


for  PM 


.VW.  . 
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mQ 


(7.22) 
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for  FH 


uma  VV^T,  2nF,  . 

u*a  "  U*  '  V3  Q  * 


(7.23) 


for  sinusoidal  FH 

Q=2nA/m  (7.24) 

and 


_"n  _  — ! —  (7.25) 

t/,  K3«* 

where 


is  the  Modulation  index,  while  Afm  —  frequency  deviation. 

Formulae  (7.21) — (7.23)  coincide  with  the  corresponding  formulas  for  real 
AM,  PM,  and  FM  signal  receivers. 

Note  1.  The  term  real  FM  (PM)  signal  receivers  here  and  in  future  is  understood 
to  mean  receivers  comprising  an  rf  amplifier,  amplitude  limiter,  frequency  (phase) 
detector,  and  af  amplifier.  The  term  real  AM  signal  receiver  is  understood  /108 
to  mean  a  receiver  comprising  an  rf  amplifier,  amplitude  (nonsynchronous)  detector, 
and  af  amplifier.  Formula  (7.21)  is  valid  for  a  real  AM  signal  receiver  with 
a  synchronous  detector  not  only  given  slight,  but  given  essentially  any,  signal- 
to-noiBe  ratio. 

Note  2.  Strictly  speaking,  relationships  (7. 21) —(7. 25)  occur  for  real  receivers 
only  in  the  event  that,  during  computation  of  noise  voltage  ,  the  signal 
is  assumed  to  be  unmodulated*  Therefore,  they  are  precise  only  given  a  slight 
signal  modulation  capability,  i.  e.,  given  slight  m,  «*  ,  and  m»  .  However, 


it  is  possible  to  consider  them  approximately  valid  also  for  large  a,  «+  ,  and 
m,  values. 

Consequently,  given  a  slight  signal- to-noise  ratio,  the  noise  immunity  of 
real  AM,  PH,  and  FM  signal  receivers  coincides  with  potential  noise  immunity. 

Hence,  it  follows  also  that  the  advantages  of  PM  and  FM  compared  to  AM  in  the 
case  of  optimum  receivers  are  identical  to  those  in  the  case  of  real  receivers — given 
slight  noise,  use  of  PM  and  FM  provides  an  advantage  that  is  greater,  the  greater 
and  m,  ,  respectively.  However,  the  greater  and  m , ,  then,  given 
less  noise,  anomalous  errors  arise  ameliorating  this  advantage. 

Given  PPM,  based  on  the  law  provided  in  §  6.2  we  obtain 


& <0- 


(7.27) 


where  Fm  —  pulse  repetition  frequency  [PRF3|  U*  —  effec'  Ive  signal  amplitude 
characterizing  ita  average  power  equalling 


If  the  minimally-pemisaible  PRF  is  selected,  that  being  Fn~  2 F,  ,  then 
formula  (7.27)  takes  the  form 


Elif)- 


Wf, 

u9% 


Considering  expression  (7.20)  and  assuming  that  Fn  s  0,  we  obtain 


(7.27a) 


uma  m  Vtt  (7.28) 

U*  U.  '  Q  * 


Comparing  formulas  (7.23)  and  (7.28)  and  considering  that,  for  FM 


it  is  easy  to  become  convinced  that,  given  identical  F ,  and  Q  and  equal 
effe  ive  amplitudes  Utt  and,  consequently,  equal  average  signal  powers  in  the 
case  of  PPM,  resultant  ratio  Uma/Uua  is  greater  by  a  factor  of  approximately 
2  than  is  the  case  for  FM.  Consequently,  potential  noise  immunity  is  somewhat 
better  for  FM  than  for  PPM. 


7.2  Impact  of  Signal  Modulation  Parameters 
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Kotel'nikov  obtained  all  formulas  presented  in  the  previous  section  in  the 
assumption  that  signal-to-noise  ratio  is  sufficiently  slight.  Here,  for  the 
modulation  types  examined,  the  noise  immunity  of  real  receivers  equals  potential 
noise  immunity  or  approximates  it  (given  good  limiters  and  proper  selection  of 
bandwidth  ahead  of  and  beyond  the  demodulator).  Therefore,  it  is  possible  in 
the  first  approximation  to  assume  that,  if  an  optimum  receiver  must  operate  always 
in  a  slight  signal-to-noise  ratio  mode,  then  its  structure  may  be  identical  to 
that  of  real  receivers,  i.  e.,  it  may  comprise  an  rf  amplifier,  demodulator,  and 
af  amplifier.* 


Let  the  ratio  of  the  effective  noise  and  signal  voltages  at  demodulator  input 
equal  UnJUc  .  Then,  in  order  for  the  §  7.1  formulas  to  be  valid,  it  is  necessary 
to  meet  the  condition 


(7.29) 


i.  e.,  the  signal-to-noise  ratio  must  be  slight  already  at  demodulator  input, 
not  only  at  its  output.** 


*Here  and  in  future,  the  term  rf  amplifier  is  understood  to  mean  all  linear 
stages  ahead  of  the  demodulator  (including  frequency  converters  as  well  in  the 
case  of  a  superhetrodyne  receiver),  while  the  term  af  amplifier  is  understood 
to  mean  all  linear  stages  connected  beyond  the  demodulator.  It  is  assumed  that 
the  amplitude  limiter  (for  FM  and  PM)  and  minimum  limiter  (in  the  case  of  pulse 
modulation)  are  part  of  the  demodulator. 

**In  the  case  of  an  AM  signal  receiver  with  synchronous  detector,  as  already 
noted  in  §  7.1,  there  ie  no  requirement  to  meet  condition  (7.29). 
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Permissible  signal-to-noise  ratio  Oma/UmQ  or  Uma/U^a  at  receiver  output 
is  set  during  receiver  design  since  this  is  precisely  what  determines  message 
reproduction  quality.  The  higher  the  levied  reproduction  quality,  the  lower  ratio 
Uma/UMQ  must  be.  However,  given  the  possibility  of  optimum  signal  parameter 
selection,  it  turns  out  that  condition  (7.29)  is  not  met  for  the  majority  of 
modulation  types,  even  if  requisite  ratio  Uma/UHa  at  output  is  very  slight. 

For  example,  we  will  examine  the  case  of  an  FM  signal. 

If  condition  (7.29)  is  met,  then  formula  (7.23)  is  valid.  Thus,  based  on 
this,  one  should  increase  signal  frequency  deviation  W2n  in  order  to  improve 
noise  immunity.  But,  both  rf  amplifier  bandwidth  P  (where  ”*•  >  1,  we  obtain 
fJ  2U'2n )  also  correspondingly  must  be  increased  here,  as  a  result  of  which  noise 
voltage  Um  rises.  Until  condition  (7.2 9)  is  met,  formula  (7.23)  is  valid  and, 
consequently,  it  is  advantageous  to  continue  to  increase  signal  frequency  deviation. 

A  further  parameter  Q  increase  may  turn  out  to  be  inadvisible  only  if  condition 
(7.29)  is  so  far  from  being  met  that  formula  (7.23)  validity  is  destroyed  and 
a  further  Q  increase  begins  to  lead  in  actuality  to  a  ratio  UmQIUm a  increase, 
rather  than  its  decrease.  /110 

Hence  it  follows  that  the  optimum  will  be  that  signal  frequency  deviation 
value  Q«BT/2n  at  which  condition  (7.29)  is  not  met  and  formula  (7.23)  a  fortiori 
is  imprecise,  even  if  the  signal-to-noise  ratio  at  output  must  be  slight.  Thus, 
for  example,  analysis  demonstrates  that,  in  the  case  of  sinusoidal  FM  where  modulation 
index  value  m,  providing  the  minimum  UmafUna  ratio  is  optimum,  this  ratio  is 
determined  already  not  from  formula  (7.23),  but  from  the  following  approximate 
relationship: 


uma  VEtfl  1 


(7.30) 


i.  e.,  formula  (7.23)  provides  an  error  of  almost  a  factor  of  2.  Consequently, 
formula  (7.23)  will  be  valid  for  FM  only  in  the  event,  along  with  meeting  the 
condition 
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for 


signal  modulation  index  "»*  will  be  significantly  less  than  value  m,  ftnT 
a  given  ratio  Uma/UMo  (the  less  UmalUua  t  the  greater  m»»nr  ). 

Analogously,  for  PM,  one  may  demonstrate  that  some  PM  coefficient  optimum 
value  out  exists,  as  does  signal  spectrum  bandwidth  value  2Si0„T/2n  for  PPM. 
Condition  (7.29)  is  not  met  where  the  values  of  these  parameters  are  optimum  and 
the  formulas  presented  in  §  7.1  are  imprecise  for  a  slight  signal-to-noise  ratio, 
even  given  very  slight  Uma!UMQ  ratios. 

For  AM,  ratio  Uma/UHa  will  depend  only  on  those  signal  parameters  (Uq  and 
m),  which  do  not  impact  upon  its  spectral  width.  Therefore,  optimum  signal  parameter 
selection  here  does  not  cause  a  condition  (7.29)  disruption. 

The  analysis  performed  permits  the  following  conclusions  to  be  made: 

1.  Formulas  (7.21) — (7.25)  are  sufficiently  precise  only  when  condition 
(7.29)  is  met,  i.  e. ,  for  a  slight  signal-to-noise  ratio  at  demodulator  input.* 

2.  Condition  (7.29)  is  not  met  for  all  the  modulation  types  examined  above, 
with  the  exception  of  AM,  given  optimum  signal  parameters,  even  for  a  very  slight 

U malUuQ  ratio,  and  formulas  (7.21) — (7.25)  may  provide  considerable  error.  Therefore 
in  such  a  case,  it  is  possible  to  use  the  aforementioned  formulas  only  for  a  rough 
approximation. 

3.  It  follows  from  point  2  that,  for  the  majority  of  modulation  types  (PM, 

FM,  PPM,  and  others),  the  formulas  presented  above  are  precise  only  for  the  /111 
corresponding  signal  parameter  values  leas  than  optimum,  i.  e. ,  when  the  following 
conditions  are  met,  respectively: 


a) 

For  FM 

b) 

For  PM 

*W»  'S.  OHM 

c) 

For  PPM 

Q«Oont. 

♦See  the  note  to  condition  (7.29). 


(7.31) 


7.3  Impact  of  Receiver  Frequency  Characteristic  Shape 


1.  General  Comments 

We  will  examine  the  concept  for  assumption  3  made  in  §  7.1  relative  to  optimum 
receiver  structure. 

In  accordance  with  this  assumption,  receiver  structure  for  slight  noise  must 
be  such  that  it  will  supply  no  message  x(t)  distortions  when  there  is  no  noise, 
while  the  structure  remains  unchanged  when  noise  appears. 

In  the  general  case,  one  may  consider  that,  given  noise,  the  resultant  error 
8P(/)  in  message  reproduction  comprises  two  components: 

6P(0 -«(/)+ «„(/),  (7*32) 

where  8a(/)  —  dynamic  error,  i.  e. ,  the  error  which  would  occur  when  there  is 
no  noise,  while  (5  (t)  —  additional  error  component  caused  by  noise  action  and 
referred  to  in  future  as  fluctuating  error. 

Error  6a(/)  was  referred  to  as  dynamic  because  it  is  caused  primarily  by 
the  irregularity  of  receiver  frequency  characteristics  (ahead  of  and  beyond  the 
demodulator),  i.  e.,  its  inertness,  and  it  rises  with  an  increase  in  the  rate 
of  message  x(t)  change,  i.  e.,  its  "dynamicity." 

In  accordance  with  the  Kotel'nikcv  assumption,  the  structure  of  the  optimum 
receiver,  given  slight  noise,  must  be  such  that  dynamic  error  6a(0  is  identical 
with  zero.  However,  this  assumption  is  not  always  valid. 

If  noise  intensity  is  infinitely  slight,  then  it  would  be  senseless  to  use 
that  receiver  structure  in  which  dynamic  error  is  not  identical  with  zero  since, 
in  this  case,  the  mean  square  of  resultant  error  6"?  would  be  a  fortiori  greater 
than  for  the  receiver  structure  which  insures  6B(t)  —  0  .  But,  given  infinitely- 
slight  noise  intensity,  the  task  of  insuring  noise  immunity  becomes  completely 
senseless.  If  noise  intensity  is  only  very  slight,  but  finite,  then  it  may  /112 
tum  out  to  be  advisible  to  narrow  the  bandwidth  (ahead  of  and  beyond  the  demockj- 
lator),  even  if  some  dynamic  errors  result. 
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Actually,  narrowing  the  frequency  band,  causing  a  rise  in  dynamic  error, 
simultaneously  decreases  the  fluctuating  error;  therefore,  the  magnitude  of  the 
resultant  error  here  may  decrease.  Consequently,  even  given  very  slight  noise, 
it  is  necessary  in  the  general  case  to  strive  to  select  receiver  rf  and  af  frequency 
characteristics  (i.  e.,  ahead  of  and  beyond  the  demodulator)  to  insure  minimum 
resultant  error  6p  . 

For  simplicity,  we  will  evaluate  the  errors  based  on  their  mean  squares. 

Then,  the  optimum  receiver  will  be  the  one  insuring  minimum  mean  square  of 
the  resultant  error. 

Consequently,  it  is  necessary  to  select  receiver  rf  and  af  frequency  char¬ 
acteristic  shapes  from  the  minimum  mean  square  of  the  resultant  error  6^*  in  message 
x(t)  reproduction. 

2.  Connection  of  Optimum  Linear  Filters 

Initially,  we  will  examine  the  methodology  for  finding  the  optimum  shape 
of  the  receiver  af  frequency  characteristic  (i.  e.,  beyond  the  demodulator). 

Here,  we  will  assume  for  simplicity  that  the  rf  bandwidth  is  so  broad  that 
no  dynamic  errors  arise  in  the  rf  stages. 

Also,  we  will  assume  that  condition  (7.29)  is  met.  Then,  the  assumption 
of  the  possibility  of  dynamic  errors  in  af  stages  is  the  only  difference  between 


Figure  7.1.  (a)  —  Optimum  linear  filter  (OLF). 

the  optimum  receiver  examined  and  the  optimum  Kotel'nikov  receiver.  Therefore, 
such  a  generalized  optimum  receiver  must  comprise  receiver  /7„  ,  optimum  in  the 


Kotel'nikov  sense,  and  optimum  linear  filter  (OLF)  connected  at  its  output  (Figure 
7.1). 


The  OLF  must  insure  receipt  of  the  minimum  mean  square  error  in  reproduction 
of  usable  oscillation  ua(0  ,  distorted  by  additive  noise  u,bq(/)  ,  arriving  at 
its  input.  Such  a  filter  was  examined  in  §2.2  for  a  case  when  «o(0  is  a  stationary 
random  process.  For  simplicity,  we  will  not  burden  this  filter  with  the  condition 
of  physical  realization.  In  addition,  we  will  assume  that  usable  message  /113 
«o(/)  and  fluctuating  error  component  uma(()  statistically  are  independent  [the 
validity  of  such  an  assumption  is  greater,  the  less  the  signal  is  capable  of  being 
modulated  by  message  x(t)].  Then,  the  filter  frequency  characteristic  must  meet 
condition  (2.19),  which  may  be  written  in  the  following  form: 


«+£'■  (/)/£*,(/)  ’ 


(7.33) 


where  EUf)  and  ££,(/)  —  power  spectra  of  oscillations  x(t)  and  (5  (t),  respectively, 
at  Kotel'nikov  receiver  output.  Here,  it  is  assumed  that  message  x(t)  may  be 
considered  a  stationary  random  process. 


Since 


uq  (0  —  C/m  a  x  (t)  and  uma  (0  =  Uu  a  6  (i), 

then,  the  power  spectra  of  oscillations  uq(1)  and  ubb(0  equal  U\aE\(f)  and 
U\q  £*,(/)  ,  respectively.  Therefore,  formula  (2.20)  for  the  mean  square  error 
at  filter  output  may  be  written  in  the  following  form: 


Sam - df, 

f  (/>/£«•</> 


(7.34) 


where 


e  (/)  *■  Kim  (/)  —  Mg  (0  "  «»o(0* 


(7.35) 


163 


.1 


It  is  accepted  in  formula  (7.34),  as  opposed  to  (2.20),  that  spectra 
£,'(/)  and  £„*(/)  will  not  comprise  negative  frequencies. 

It  follows  from  formulas  (7.33) — (7.33)  that 


(7.36) 


where 


U’m0  ~  VT*  • 


(7.37) 


Formula  (7.36)  determines  the  relative  mean  square  error  at  a  generalized 
optimum  receiver  output.  One  may  make  the  following  conclusions  from  comparison 
of  formula  (7.36)  with  the  corresponding  formula  (7.20)  obtained  for  the  optimum 
Kotel'nikov  receiver: 

1.  There  is  no  requirement  to  know  the  high  and  low  frequency  of  the  message 
x(t)  spectrum  for  a  generalized  optimum  receiver  as  is  the  case  for  the  optimum 
Kotel'nikov  receiver.  This  is  more  convenient  since,  in  many  actual  cases,  message 
spectrum  £»*(/)  does  not  have  sharply-defined  cutoff  in  the  region  of  the  /114 
high  and  low  frequencies  and  it  is  unclear  what  £*  and  £a  in  formula  (7.20) 
should  be  understood  to  mean. 

2.  Even  if  frequencies  Fu  and  £a  are  precisely  known,  one  obtains 


i  --  •'■a 
'  V • 


(7.38) 


i.  e.,  connection  of  the  optimum  linear  filter  at  receiver  output  may  provide 
a  decrease  in  relative  message  reproduction  error.  This  decrease  is  all  the  more 
significant,  the  greater  the  signal-to-noise  ratio  and  the  longer  the  curve 
£**(/)  "tails”  in  the  region  of  high  and  low  frequencies. 

Examples  illustrating  these  postulations  are  presented  in  [123]. 
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PART  THREE  /115 

OPTIMAL  RECEPTION  OF  SIGNALS  WITH  RANDOM  PARAMETERS  (ANALYSIS  USING  THE  INVERSE 
PROBABILITIES  APPROACH) 

CHAPTER  EIGHT 

GENERAL  RELATIONSHIPS 

8.1  Problem  Formulation 

In  the  theory  presented  in  Part  II,  Kotel'nikov  assumes  that  signal  u^Ct) 
is  precisely  known,  while  the  best  receiver  is  the  one  which  selects  the  message 
x  value  with  maximum  inverse  probability  P^(x)  as  the  true  value.  Therefore, 
the  Kotel'nikov  approach  may  be  called  the  maximum  inverse  probability  approach, 
applicable  to  precisely-known  signals. 

Following  publication  of  Kotel'nikov' a  work  in  1946  [1],  the  theory  of  optimum 
reception  methods  developed,  first,  through  examination  of  signals  with  random 
|  parameters  and,  second,  through  generalization  of  receiver  optimization.  The 

first  important  results  along  these  lines  were  obtained  in  the  works  of  Woockrard 
and  Davis  [2,  28,  aid  others]. 

As  demonstrated  in  §  5.1,  for  given  signal- plus-noise  y(t),  maximum  usable 
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information  on  message  x  achievable  at  the  point  of  reception  is  inverse  probability 
distribution  Py(x).  Therefore,  the  job  of  the  optimum  receiver  is  to  compute 
distribution  Py(x).  A  decision  on  what  message  x  was  must  be  made  on  the  basis 
of  analysis  of  this  distribution  type. 

A  decision  may  be  made  in  the  simplest  case  from  the  maximum  inverse  probability 
principle  that  Kotel'nikov  used.  However,  in  several  instances,  the  maximum  inverse 
probability  principle  is  not  best,  as  even  Kotel'nikov  demonstrated.  Therefore, 
in  the  general  case,  it  is  possible  to  confirm  only  that  the  optimum  receiver 
must  decide  (using  some  specific  method)  based  on  analysis  of  inverse  probability 
distribution  Py(x).  This  general  principle  of  finding  the  optimum  decision  will 
be  called  the  inverse  probability  approach.  Then,  it  is  possible  to  consider 
the  maximum  inverse  probability  criterion  one  specific  variant  of  the  overall 
approach. 

Examples  of  using  the  inverse  probability  approach  to  solve  various  /116 
problems  of  receiving  signals  with  random  parameters  in  the  presence  of  additive 
random  noise  um{t )  are  examined  in  subsequent  chapters  (9-13).  Basic  analysis 
will  be  performed  assuming  that  this  is  normal  white  noise.  However,  in  §  12.9, 
results  obtained  are  generalized  for  random  noise  with  an  irregular  power  spectrum. 

As  was  the  case  in  Part  II,  analysis  initially  will  be  performed  for  discrete 
messages,  then  we  will  look  into  receipt  of  individual  analog  message  values  and 
reception  of  oscillations.  Here,  considerably  more  attention  is  devoted  to  different 
signal  detection  cases,  i.  e.,  actually  establishing  whether  or  not  a  signal  is 
present  at  receiver  input.  This  is  because,  in  recent  years,  the  theory  of  optimum 
reception  methods  began  to  be  used  widely,  not  only  in  radio  communications,  but 
also  in  radar,  radio  control,  and  radio  astronomy,  i.  e. ,  in  fields  where  reliable 
signal  detection  is  just  as  vital  as  is  precise  reproduction  of  messages  these 
signals  carry. 

Initially,  we  will  examine  simple  binary  detection,  i.  e.,  detection  of  the 
only  possible  signal.  Then,  we  will  look  into  complex  binary  detection,  i.  e, 
detection  of  one  of  many  possible  signals  (there  is  no  requirement  here  to  determine 
exactly  which  of  m  possible  signals  is  present.  It  suffices  to  establish  that 
one  possible  signal  actually  is  present). 


Simple  binary  detection  in  radar  corre8pond8  to  detection  of  an  object  (aircraft) 
in  one  predetermined  elementary  cell  of  space.  If  the  requirement  is  to  establish 
whether  an  object  is  present  in  any  one  of  m  elementary  cells  (without  specifying 
exactly  which  one),  then  this  task  equates  to  complex  binary  detection  (in  future, 
for  brevity,  the  word  "simple"  often  is  omitted  from  the  term  "simple  binary  detec¬ 
tion,"  but  the  word  "complex"  always  is  retained  when  we  refer  to  complex  binary 
detection. 

Simultaneous  detection  and  recognition  (distinction)  of  m  possible  signals 
are  examined  in  Chapters  11  and  12. 

Analog  message  reception  is  examined  in  Chapter  13.  However,  here  the  ter¬ 
minology  used  differs  somewhat  from  the  Kotel'nikov  terminology  used  in  Part  II 
of  this  book. 

Kotel'nikov  described  the  parameter  of  the  signal  carrying  usable  message  x 
in  the  form 

+  *"*)*  (8.1) 

where  x  —  normalized  message,  i.  e.,  dimensionless  magnitude  in  the  range  of 
+1.  For  example,  for  AH 

a-OtO  +  mx), 

where  a  —  signal  amplitude,  while  m  —  percent  modulation. 

An  (8.1)  type  entry  is  convenient  and  natural  in  the  case  of  radio  /117 
communications,  when  message  x  is  transmitted  by  means  of  the  corresponding  modulation 
of  the  signal  parameter.  In  the  case  of  radar,  radio  astronomy,  radiometry,  and 
in  several  other  cases,  signal  modulation  type  by  the  message  may  not  be  selected 
according  to  our  wishes  and,  during  reception,  there  usually  is  a  requirement 
to  measure  the  magnitude  of  this  or  that  parameter  of  a  signal  carrying  a  usable 
message.  Thus,  for  example,  there  is  a  requirement  to  measure  signal  amplitude  a 
or  moment  of  arrival  T  (computed  from  some  base  time)  or  its  frequency  f.  In 


these  cases,  measured  perameter  *1  ,  rather  than  one  of  its  elements  x,  included 
in  expression  (8*1)  may  be  considered  a  usable  message. 


If  modulation  parameters  and  m  are  known*  (or  may  be  considered  known), 
then  measurement  of  parameter  n.  is  fully  equivalent  to  measurement  of  normalized 
message  x*  Here,  transition  from  measurement  of  parameter  n  to  measurement  of 
normalized  message  x  and  vice  versa  will  be  accomplished  easily  by  using  relationship 
(8.1). 

Thus,  for  example,  it  follows  from  this  relationship  that,  if  Py(x)  s  f^x) 
and  Py  (  n  )  *  f^(  n  )  --  inverse  probability  densities  of  magnitudes  x  and 
n  ,  then 


and 


p,  to  -  +"»)]  -m%M<k(l  +  «ur)| 

M— -')• 

/  m«ta  \m%  / 


(8.2) 


According  to  these  formulas,  knowing  distribution  Py(x),  it  is  simple  to  find 
distribution  Py(  r>  )  and  vice  versa.  Further  exposition  mainly  will  apply  to 
a  non-no realized  message;  however,  in  future  it  often  is  designated  x,  rather 
than  n.  . 

In  several  cases,  there  may  be  a  requirement  for  simultaneous  measurement 
of  several  signal  parameters.  For  example,  if  it  is  necessary  to  measure  both 
range  to  an  airborne  aircraft  and  its  radial  velocity,  then  the  task  boils  down 
to  simultaneous  measurement  of  the  time  a  signal  reflected  from  an  aircraft  arrives 
and  signal  frequency.  An  analogous  task  arises  also  in  the  case  of  multichannel 
communications,  i.  e.,  during  simultaneous  transmission  of  several  different  messages 


*It  follows  from  (8.1)  that  n,  -  and  m-  .  therefore, 

2  2 

if  limits  %nm  and  of  perameter  n.  change  are  known  (and  finite),  then 

magnitudes  n  and  m  also  are  known. 


Measurement  of  just  one  parameter  is  examined  in  Part  III  for  the  purpose  of 
simplicity.  The  special  features  of  simultaneous  measurement  of  two  or  more 
parameters  are  analyzed  in  Part  IV  of  the  book. 

8.2  Methodolgy  of  Computation  of  Inverse  Probabilities  /118 

F inding  distribution  Py(x)  for  precisely-known  signals  was  examined  in  the 
preceding  part  of  the  book.  We  will  explain  which  special  features  in  P^(x)  com¬ 
putation  arise  when  the  signal  has  parasitic  random  parameters  Ct  p  Ct  2* 


Initially,  for  simplicity,  let  the  signal  have  only  one  parasitic  random 
parameter  Ct  .  Here,  in  accordance  with  (1.9) 

y(t)  “«*.«(/)  +  </„(/).  (8.3) 

where  um(t)  —  additive  noise  with  known  a  priori  distribution  W m(um)  . 

In  an  analogy  with  relationship  (4.6),  in  this  case  it  is  possible  to  write 

P,(x,  a)=*kP(x,  a)P,.a(y),  (8*4) 

since  now  the  signal  has  two  random  parameters  (x  and  Ct  ),  rather  than  one. 

In  this  expression,  P(x,  CC  )  —  a  priori  joint  distribution  x  and  Ct  ,  k  —  constant 
factor  (not  dependent  on  x  and  Ct  ),  which  may  be  determined  from  normality  condition 

S  j  Pr(x,a)d*d<*~l,  (8<5) 

where  Ax  and  —  fields  of  all  possible  values  of  parameters  x  and  Ct  ,  res¬ 
pectively,  Py(x,  Ct  )  —  probability  (probability  density)  that,  for  given  reali¬ 
zation  y(t),  signal  random  parameters  equal  x  and  CC  ,  respectively. 

We  are  interested  in  probability  Py(x),  i.  e. ,  the  probability  that,  for 
given  realization  y(t),  the  message  equals  x;  here,  parasitic  parameter  Ct  may 
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have  any  value.  Therefore,  F(x)  determination  requires  integration  of  function 
Py(x,  CC  )  with  respect  to  all  possible  parameter  CC  values: 


P,  (Jf)  ==  $  P,  (*.  a)  da. 


(8.6) 


It  follows  from  formulas  (8.4)  and  (8.6)  that  finding  distribution  Py(x) 


requires  knowing  functions  Py(x,  CC  )  and  Px  ^  (y). 

In  the  general  case 

P(x,a)=P(x)P,(a)i 

if  x  and  Ct  statistically  independent,  which  usually  is  the  case,  then 

P (Jf.  a)  =P(x) P(a),  (8.8) 


(8.7) 


where  P(x)  and  P(  Ct  )  —  a  priori  distributions  of  message  x  and  parasitic  /119 
parameter  Ct  ,  respectively,  which  are  assumed  known  or  which  must  be  given 
due  to  certain  circumstances  (see  Chapter  19).  Consequently,  distribution 


Px  ^  (y)  remains  to  be  found. 


If  additive  noise  um(t)  has  no  statistical  coupling  with  parameters  x  and 
CC  (which  usually  is  the  case),  then  it  follows  from  (8.3)  that  probability 


P  ^  (y)  that  total  oscillation  equals  y  for  given  x  and  CC  is  determined  from 
e^c 


the  following  relationship: 


P*.*(y)-WmlyU)— «*. « (01. 


(8.9) 


i.  e.,  distribution  Px  (y)  is  obtained  from  noise  distribution  Wmiti^iby  replacement 
of  argument  uu  by  difference  (y— ux  q  ).  Thus,  for  example,  for  normal  white 
noise,  in  accordance  with  formulas  (1.25)  and  (8.9),  we  obtain 


P*,*(y)' 


(|’  TnFiy 


■eap[— 1 ly(0— ««.«(01*  d/ 


(8.10) 
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Thus,  In  a  case  of  signals  with  random  parameters,  distribution  Py(x)  computation 
requires  that : 

1.  Distribution  ^  (y)  with  respect  to  formula  (8.9)  be  computed  with 
respect  to  known  additive  noise  distribution  Wm(um)  . 

2.  Distribution  Py(x,  CC  )  be  computed  with  respect  to  formulas  (8.4)  and 
(8.7)  [or  (8.8)]. 

3.  Desired  distribution  P^(x)  be  computed  with  respect  to  formula  (8.6). 

After  distribution  Py(x)  is  found,  further  mathematical  analysis  is  the  same 
as  analysis  of  a  precisely -known  signal. 

For  example,  if  the  optimum  receiver  operates  on  the  maximum  inverse  probability 
principle,  then  oscillation  Y  (t)  at  receiver  input  is  determined  by  relationship 
(4.2). 

If  the  signal  has  n  parasitic  parameters  OC  2,  .  .  .  #  GC  rather 

than  one  parameter,  then,  analogous  to  the  relationships  presented  above,  the 
result  is 


P,(*)=  ^  ***  5  . ®»Mai  ••• 


(8.11) 


where 


Py  (jfj  Otji  «.««  fl,)  * (^1  ••••  ®n)  P *\  ®*.  •••• 

Pm;  . ( y )  -  I*  W— « (*;  «« . «■:.  «l. 


(8.12) 


(8.13) 


Given  independent  x,  CC  ,  .  .  .,  GC  ^ 


/>(*;  o, . ®„)  =  P(x)P («i)  -  P (»«)• 


(8.14) 


171 


CHAPTER  NINE 


/120 


BINARY  DETECTION 

9.1  Computation  of  the  Inverse  Probability  of  a  Random-Phase  Signal 
Let 

(9.1) 

where 


ui.'  (/)-acos(w/-h*).  (9.2) 

We  will  compute  distribution  Py(a)  of  signal  amplitude  inverse  probability 
densities,  assuming  frequency  <o  is  known,  while  phase  9  is  random  and  equally 
probable,  i.  e. 

P  ~  •  where  f  *0  +  2a; 

P( 9)-»0  outside  these  limits  (9.3) 


Amplitude  a  has  known  a  priori  distribution  P(a).  Amplitude  a  and  phase 
<p  are  assumed  to  be  analog  random  magnitudes,  i.  e.,  during  observation  cycle 
(0,  T),  they  are  constant,  but  random,  and  have  probability  densities  P(a)  and 


P(  4>  ).  Distribution  P^(a)  computed  in  this  section  in  future  will  be  used  both 
for  solution  of  the  binary  detection  problem  and  for  signal  amplitude  measurement. 

During  distribution  Py(a)  computation,  amplitude  a  plays  the  role  of  desired 
usable  signal  x,  while  phase  $  is  a  parasitic  random  parameter.  Therefore, 
general  formulas  (8.11)— (8.14)  in  this  case  have  the  form 


ft 


N 


in 

?)«*»• 

0 


where 


P'.* 


P„(a,  (p)-AP(a,  (if); 

“■iW'’'  [~i/ Ml**]  I 


P(a,  <p)-P(a)P(<p) 

(a  and  <t>  are  assumed  to  be  statist ically-independent  magnitudes). 

Expanding  the  expression  (9.6)  parentheses  under  the  integral  and 
considering  relationship  (9.2),  we  obtain 


P«,  *  (if)  *■ 


•  _  b 


(V&iR)* 


-r  f  »•<»>«  a»r 

e-5^en 


where 


n(<*. 


y(/)  cos  (a/  +  <p)<ft. 


From  relationships  (9.3)— (9.5),  (9.7),  and  (9.8),  we  obtain 

••r  Jr 


P,{o)-klP(a)t~vr'  J 
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(9.4) 


09.5) 

(9.6) 

(9.7) 
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(9.8) 


(9.9) 


(9.10) 


where  k,  —  constant,  which  includes  all  factors  not  dependent  on  variable  a. 
It  follows  from  (9.9)  that 


H  (a,  <p)  *.  =■  (X  cos  «p — Y  sin  <p), 


(9.11) 


where 


r  r 

X-J  y(f)cas»tdt\  $  y(t)  Sin  at  dt. 


(9.12) 


To  simplify  expression  (9.10)  integration,  we  will  introduce  new  variables 
M  and  6  so  that 


X«Afcos0;  K-M*in0;i.  e.  Al  =  />(»+*'». 


(9.13) 


Then,  expression  (9.11)  takes  the  form 


H  (a.  <p)  ~  M  cos(0+  9), 


(9.14) 


and  the  integral  included  in  formula  (9.10)  easily  will  be  removed: 


«'r  3*  *•*  eo«  c*+  •) 

P,(a)-*,P(a)e  ^  J  e"^  d? 

*»r 

-*P<a)e~W!l.(5£). 


where  I  (z) 
o 


modified  Bessel  function. 


(9.15) 


Since  it  is  important  to  know  only  the  curve  Py(y)  shape,  and  its  scale  plays 
no  principal  role,  then  it  is  possible  to  assume  that  kj  =  1  in  (9.15).  In  addition, 
it  often  is  more  advisible  during  computational  device  realization  to  compute  /122 
In  Py(a)  rather  than  Py(a).  Therefore,  instead  of  (9.15),  it  also  is  possible 
to  write 


(9.15a) 


r. 

r. 

h 

s 

i, 

Lr.  • ■  -  • 
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lnPy(a)-lnP(a)-|~  +lnl,^). 

It  follows  from  expression  (9.15)  that  magnitude  M  must  be  computed  prior 
to  determination  of  desired  distribution  P^(a).  In  the  general  case,  this  parameter 
may  be  computed  using  formulas  (9.12)  and  (9.13). 

A  functional  diagram  of  the  corresponding  computational  device  is  depicted 
in  figure  9.1.  It  will  comprise  generators  A  and  A  ,  which  generate  the  sinu- 


Figure  9.1.  (a)  —  KOR  [correlator];  (b)  —  BM  [balanced  modulator]; 

(c)  —  D  [detector];  (d)  —  G  [generator]. 

soidal  voltages  of  signal  »  frequency  and  some  random  supplemental  frequency 
q  ,  correlators  KOP,  and  KOPt  ,  balance  modulators  BM,  and  BMt ,  which 
perform  multiplication  operations,  aryj  linear  amplitude  detector  A'  ,  which 
separates  the  total  oscillation  X  cos  fl/  +  Y  sin  Uf  envelope.  The  magnitude  of 
this  envelope  also  equals  desired  parameter  M  (all  constant  factors  in  Figure 
9.1  for  simplicity  are  assumed  to  equal  unity). 

A  computational  device  corresponding  to  the  Figure  9.1  circuit  is  relatively 
complex.  However,  given  several  assumptions  which  usually  are  the  case,  it  msy 
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be  replaced  by  an  optimum  linear  filter,  which  provides  maximum  signal-to-noise 
ratio  and  which  is  described  in  §  2.3. 


Actually,  it  follows  from  formulas  (9.9)  and  (9.14)  that  M  is  the  envelope 
of  oscillation  gjq(a,  9)  ,  considered  a  function  of  <p  ,  i.  e.,  the  envelope 
of  function 


T 

P  (»P)  —  $  y  (/)  cos  (oi/  +  q>)  dt. 
o 


(9.16) 


But,  it  was  shown  in  §  4.3  that,  if  signal  uc(t)  lasts  only  during  /123 
interval  (0,  T),  then  optimum  filter  output  voltage  at  moment  T  equals 


r 

“»m*  (T)  **const  $  y (t)ut(t) dt.  (9.17) 

o 

Here,  optimum  is  understood  to  mean  a  filter  matched  with  signal  u  (t),  i.  e., 

C 

having  impulse  transient  characteristic  equalling 

q  (0 -const  0. 


where  t  =  T. 
o 

Comparing  expressions  (9.16)  and  (9.17),  it  is  easy  to  become  convinced  that 
fH<p)  coincides  (precise  to  a  constant  factor)  with  the  voltage  magnitude  (at  moment 
T)  at  the  output  of  an  optimum  linear  filter  matched  with  signal  u  (t).* 

a 

In  accordance  with  (9.16),  this  voltage  is  a  function  of  rf  occupation  phase 


*1rfe  considered  that  the  filter  must  be  matched  with  a  signal  in  the  form 

coral  -cos  (ut  +  <f)  .  Phase  $  of  this  signal  is  random  and,  consequently,  is  unknown. 

But,  since  the  requirement  is  reproduction  of  just  the  envelope  of  the  output 

voltage  of  this  filter,  then  any  phase  4>  ,  such  as  $  s  0,  may  be  selected 
during  filter  design. 


♦  .  But,  magnitude  M  is  the  function  0(«r)envelope.  Therefore,  it  is  proportional 
to  the  value  of  the  envelope  (at  moment  T)  of  the  optimum  linear  filter  output 
voltage. 

It  is  evident  that  confirmation  is  sufficiently  precise  if  the  envelope  exists 
at  filter  output,  i.  e.  if  <*T  >  I,  and,  during  observation  cycle  T,  at  least 
several  rf  occupation  periods  accumulate.  This  condition  usually  is  met. 

Consequently,  as  the  optimum  receiver  is  realized,  magnitude  M  included  in 
expression  (9.15)  usually  may  be  obtained  by  oscillation  y(t)  passage  through 


Figure  9.2.  (a)  — *  OF  [optimum  filter]; 

(b)  —  D  [detector]. 

matched  optimum  linear  filter  00  and  separation  of  the  oscillation  envelope 
at  filter  output  (at  moment  T)  with  the  aid  of  amplitude  detector  H  (Figure 
9.2). 

Realization  of  the  remaining  operations  required  to  compute  function  In  Py(a) 
in  accordance  with  (9.15a)  does  not  encounter  major  difficulties.  In  particular, 
the  most  complicated  of  these  operations,  computation  of  an  In  I  (z)  type  function, 
may  be  accomplished  by  using  a  conventional  thermionic  diode  detector.  Actually, 
it  was  shown  in  [112]  that  a  conventional  diode  detector  (thermionic  diode)  for 
large  load  impedances  (R  >10—50  kilohms)  has  a  response  curve  of  the  type 

Al/_-±lnl,(ft(/m).  (9.18) 

v 

where  —  sinusoidal  oscillation  amplitude  at  detector  output;  At/.  —  rise  /124 
in  the  direct  detector  output  wltage  component  caused  by  onset  of  an  oscillation 
with  amplitude  U  ; 


I 

! 

t* 

i 

i 
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6  =  const «  10J/B. 


The  greater  detector  load  impedance  R,  the  more  precisely  its  response  curve 
is  determined  by  expression  (9.18). 

Thus,  realization  of  the  optimum  receiver  computing  inverse  probability  Py(a) 
or  its  logarithm  does  not  encounter  major  difficulties. 

9.2  Binary  Detection  of  a  Random  Initial  Phase  Signal 

Let  the  signal  at  receiver  output  either  be  absent  or  have  the  form 

ue(i)  »a,cos(a>/+q>),  (9.19) 

where  amplitude  aQ  and  frequency  u  are  precisely  known,  but  phase  <f>  is  random 
and  equally  probable,  i.  e.,  distributed  with  respect  to -law  (9.3).  Evidently, 
solution  of  the  problem  of  detecting  this  signal  equates  to  the  response  to  the 
question  of  which  of  two  possible  values  of  amplitude  a  occur  during  a  given  obser¬ 
vation  cycle:  a  =  aQ  or  a  a  0. 

In  accordance  with  the  inverse  probability  approach,  answering  this  question 
requires  comparison  between  themselves  of  inverse  probabilities  Py(aQ)  and  Py(0) 
of  the  corresponding  amplitude  values. 


1 


It  follows  from  general  expression 


(9.15)  that 


p,( 0) -*,/>«». 


1 


(9.20) 


If  the  maximum  inverse  probability  principle  is  used,  then  the  value  of  the 
amplitude  with  the  greatest  inverse  probability  should  be  considered  the  decision. 
In  this  case,  this  denotes  that  a  signal  present  decision  (aQ  =  a)  must  be  used 
f  if  it  turns  out  that 

•:  ns 


? 


Pv  (a9)  >  Py  (0). 


(9.21) 


and  the  signal  absent  decision  (a  =  0)  in  the  opposite  case,  i.  e. 

M«p.<o>-  (9*22) 

It  follows  from  (9.20)  and  (9.21)  that  a  "yes"  decision  ("signal"),  i.  e., 

(aQ  =  Q)  must  be  used  when  satisfying  inequality 


(9.23) 


where 
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P(  0) 
f  («•) ' 


(9.24) 


Consequently,  the  optimum  receiver  must  compute  magnitude  In  IQ(2aQM/No) 
and  compare  it  with  some  threshold  Uq.  The  response  "yes"  (signal)  must  be  supplied 
if  the  threshold  is  exceeded,  with  the  response  "no"  (noise  only,  no  signal)  provided 
in  the  opposite  case. 


Here,  determination  of  parameter  M  included  in  expression  (9.23)  may  be  ac- 
complished  as  indicated  in  the  preceding  section,  i.  e.,  through  use  of  the  Figure 


9.1  computational  device  or  with  the  aid  of  an  optimum  linear  filter  (Figure  9.2). 
In  the  latter  case,  the  optimum  detector  circuit  has  the  form  shown  in  Figure 
9.3. 
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If  Uj  >  UQ  (i.  e. ,  i/iH,  .>•  0)  is  obtained  at  detector  A  output  at  moment 
t  s  T,  the  response  is  "yes,"  while  it  is  "no"  in  the  opposite  case.* 

Instead  of  a  detector  with  an  In  IQ(z)  curve,  one  with  any  other  monotonic 
response  curve  may  be  used  in  the  Figure  9.3  circuit,  if  threshold  UQ  magnitude 
is  adjusted  accordingly. 

Actually,  if  voltages  and  UQ  are' subjected  to  monotonic  nonlinear  trans¬ 
formation  with  respect  to  law  U'  =  f(U)  (Figure  9.4),  then,  given  any  (but  monotonic) 


o  v,  v, 


Figure  9.4 

type  of  this  transformation,  inequalities  >  UQ  and  U^C;  UQ  will  be  transformed 
into  ineqimlities  U'  >  UQ'  and  U^'  <  Uo’,  respectively.  Therefore,  such  a  trans¬ 
formation  does  not  impact  upon  solution  of  the  signal/no  signal  problem.  Conse¬ 
quently,  when  response  curve  shape  changes,  accompanied  by  the  corresponding  threshold 
Uq  change,  signal  detection  error  probabHties  will  remain  unchanged. 

As  pointed  out  in  §  5.2,  two  types  of  errors  occur  in  binary  detection — 
false  alarms,  with  probability  P*  *  ,  and  signal  misses,  with  probability  /126 
P„  .  Here,  composite  error  probability  equals 

P„-P(0)  Piit  +  P  (°fl)  P«v  (9*25> 

The  methodology  for  computation  of  error  probabilities  Pm  and  P „p  in  this 
case  is  analogous  to  that  presented  in  §  5.2. 

*In  Figure  9.3  and  subsequently ,  the  device  clipping  detector  output  voltage 
at  moment  t  s  T  is  assumed  to  be  included  in  unit  a  . 


We  will  examine,  for  example,  computation  of  probability  P«  ». 


It  follows  from  (9.23)  that  false-alarm  probability  equals  the  probability 
of  inequality  (9.23)  satisfaction  for  an  absent  signal,  i.  e.,  when  only  noise 
is  present. 

Inequality  (9.23)  may  be  written  in  the  form 


2a,  M 

N. 


>r. 


where  z  is  determined  from  the  relationship 


In  !,<*)  =  (/,. 


(9.26) 


(9.27) 


It  follows  from  (9.13)  and  (9.26)  that  a  false  alarm  will  occur  when  the 
following  inequality  is  satisfied: 


(9.28) 


where 


I1- 


(9.29) 


It  follows  from  (9.12)  that,  for  a  signal,  i.  e.,  when  y(t)  s  «■(/)  ,  these 
relationships  occur 


T 

¥rx~7T  f  «■ (0«iCo»  <*•'. 

N*  ,  N*  i 

r 

j  um{t)a,i\T\»tdt. 

r 

It  was  demonstrated  in  §  5.2  that  random  magnitude  —  $ 


(9.30) 


has  a  normal  law  of  distribution  with  a  zero  mean  value  and  dispersion  2Q/Nq, 

T 

where  Q  =  $  «c*  (0  . 

o 

2o  2c 

Therefore,  random  magnitudes  ^  X  and  ^  Y  ,  determined  by  expressions 
(9.30),  also  have  normal  laws  of  distribution  with  zero  mean  values  and  identical 
dispersions  equalling 


(9.31) 


where 
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Q.= 


afT 
2  ' 


(9.32) 


In  addition,  considering  orthogonality  of  functions  cos  o>/  and  sin  <*>  t 
included  in  expressions  (9.30),  it  is  easy  to  become  convinced  that  magnitudes 
A'  and  ^  Y  are  statistically  independent.  Consequently,  magnitude  t  ^  is 
the  sum  of  the  squares  of  two  normally-distributed  independent  random  magnitudes 
with  zero  mean  values  and  identical  dispersions  o*  .  Such  a  random  magnitude 
is  subordinate,  as  is  known,  to  distribution  X*  with  two  degrees  of  freedom, 
ie  6«| 


P  (t|* >  z1) 


(9.33) 


2  2 

But,  it  follows  from  (9.28)  that  P(  n  >  z  )  is  nothing  but  a  false-alarm 

probab ility .  Therefore 


P»  T-e-*v»a* 


or,  considering  (9.31) 


(9.34) 


—  "V  -  ■-*  -  .W  .M 
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where,  in  accordance  with  formulas  (9.24),  (9.27),  and  (9.32),  magnitude  z  is 
determined  by  the  relationship 


to 4  +  1  (,.35) 

It  follows  from  formulas  (9.34)  and  (9.35)  that  it  suffices  for  computation 
of  a  false-alarm  probability  to  know  signal- to- noise  ratio  Qq/Nq  and  ratio  P(0)/P(aQ) 
of  a  priori  signal/no  signal  probabilities  (or  threshold  UQ). 

Miss  probability  ^np  is  determined  by  expression 


Pm-l-frM]  *e -<*./«?.)*•  i,(j K)dx,  (9.36) 

which  may  be  obtained  using  various  approaches.  We  will  examine  one  such  approach 
now.  A  second  approach  will  be  presented  in  Part  IV  (§  14.6). 

Beforehand,  we  replace  z  with  Gt  in  formulas  (9.33)  and  (9.36),  where 


then,  we  will  obtain: 


v*; 


(9.37) 


(9.38) 


pnp„l_e-o./w.J  J  /  (9*39) 

« 

In  derivation  of  formula  (9.39),  we  use  the  aforementioned  assumption  /128 
whereby  error  probabilities  P„  and  Pmp  will  not  change  if  the  shape  of  response 

curve  In  in  the  Figure  9.3  circuit  is  replaced  by  any  other  monotonic 

function  M  and  the  threshold  value  simultaneously  is  adjusted  from  Uq  to  some 
other  magnitude  U  '.  Therefore,  instead  of  Figure  9.3,  we  may  examine  the  Figure 


Figure  9.5.  (a)  —  OF  [optimum  filter];  (b)  —  D  [detector]. 


9.5  circuit.  Here,  A  —  linear  amplitude  detector  with  a  single  transfer  constant, 
which  at  moment  T  clips  the  optimum  filter  OP  output  voltage  envelope  Um(t)  value, 
i.  e. 


Ui'-U^T). 


(9.40) 


If  U^'  Uq'  results,  the  response  is  "yes"  (signal);  otherwise  the  response 
is  "no"  (no  signal). 

Consequently,  false-alarm  probability  Pa  » is  the  probability  that,  at  moment 
T,  noise  envelope  Umm(T)  at  linear  filter  output  will  exceed  Uq'. 

Since  noise  at  filter  output  has  normal  distribution,  then,  in  view  of  filter 
linearity,  the  noise  retains  normal  distribution  at  filter  output  also  and  the 
envelope  of  this  noise  Umm( Ois  subordinate  to  Rayleigh's  law.  Therefore, 


P.r  =  P  li/mrn  ( T)  >(/,']« 


(9.41) 


where  I/m*  —  mean  square  of  noise  voltage  «*(0  at  filter  output. 


Miss  probability  equals 


Pnp-l-J’n 


(9.42) 


where  Pa  o  —  probability  of  correct  signal  detection  given  the  condition  that 
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there  is  a  signal.  Consequently,  Pa  o  is  the  probability  that  the  signal- plus-noise 
envelope  at  moment  T  will  exceed  threshold  U  ' 

P*o=P\Um«n(T)>U9'],  (9.43) 

where  U„t B —  value  of  the  envelope  (at  moment  T)  of  an  oscillation,  which  is 
the  sum  of  signal  sinusoidal  voltage  and  normal  noise  voltage. 

It  is  known  [21]  that  the  probability  density  of  the  envelopes  of  such  an 
oscillation  is  determined  by  expression 

p( 

c  III  \ 


where  U„e —  signal  amplitude  value.  Therefore, 


Designating  Vmcm!Um  -  V »  we  obtain 


f  ye- 


(9.44) 


Here,  UmJU at  is  the  ratio  of  signal  amplitude  to  the  mean  square  value  of  the 
noise  voltage  at  optimum  filter  output  at  moment  T,  i.  e.,  at  that  moment  when 
signal  voltage  is  maximum.  It  was  shown  in  §  2.3  that  this  ratio  equals 
V'  2Q  ,V,  [see  formula  (2.34)],  i.  e.,  in  thi8  case 


t'w  I  at,  *  (9.45) 
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Therefore,  formula  (9.44)  may  be  written  in  the  form 


U'!Um 


Considering  (9.42),  we  have 


(9.46) 


ee  __ m 


(lit- 


y.‘/vm 

It  follows  from  comparison  of  expressions  (9.38)  and  (9.41)  that 


(9.47) 


«V 


>a; 


(9.48) 


therefore,  expression  (9.47)  coincides  with  (9,39),  which  also  required  proof. 


Thus,  it  is  demonstrated  that  formulas  (9.38)  end  (9.39)  determine  falae-alarm 
and  mias  probabilities  during  detection  of  a  sinusoidal  signal  with  an  equally- 
probable  phase  (on  a  normal  white  noise  background).  Curves  providing  the  link 
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Figure  9.6 


Figure  9.7 


between  error  probabilities  Pn  Tand  Pnp  for  verioue  signal-to-noiee  Qq/No  values 
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may  be  computed  using  these  formulas  (Figure  9.6).  These  curves  were  computed 
for  the  first  time  in  work  done  by  Peterson  and  others  [17]. 

There  is  no  requirement  to  use  the  curves  for  slight  error  probabilities 
since  simple  analytical  expressions  may  be  obtained  for  these  cases. 

It  was  demonstrated  in  [106]  that,  given  P„  T<  0,1  and  Pnp<0,l  ,  from  formulas 
(9.38)  and  (9.39)  one  obtains  the  following  with  less  than  a  0.5  dB  loss 


Where  Pa?~ ♦  0  and  /**  T-»-0  ,  the  error  for  this  formula  asymptotically  will 
strive  towards  zero. 

It  follows  from  comparing  formula  (9.49)  with  corresponding  formula  (5.31a), 
obtained  for  a  precisely-known  signal,  that,  given/**  T<  0,1  and  P „P<0,1  ,  they 
essentially  coincide. 

Consequently,  given/**  T<  0,1  and  /*..»*<  0,1  ,  approximately  the  same  signal 
energy  is  required  to  detect  a  random-phase  signal  as  is  the  case  for  a  precisely- 
known  signal. 

This  conclusion  already  is  invalid  for  high  error  probabilities.  Thus,  for 
example,  relationships  of  Qg/Ng  to  QJN,  where  P„  T  s  0.1  are  depicted  in  Figure 
9.7  for  a  random-phase  (solid  curve)  and  a  precisely -known  (dotted  curve)  signal. 

It  is  evident  from  comparing  these  curves  that,  where  P*  T  -  0.1  and  PB9  =  0.5, 
the  energy  required  to  detect  a  random-phase  signal  is  greater  by  a  factor  of 
almost  2.5  than  is  the  case  for  a  precisely -known  signal  and,  given  a  further  /131 
decrease  in  magnitude,  this  difference  will  become  even  more  pronounced. 

We  now  will  examine  in  more  detail  the  problem  of  threshold  bias  selection 
at  detector  output.  Detector  output  and  threshold  bias  in  the  Figure  9.3  circuit 
are  characterized  by  dimensionless  magnitudes Ux  —  In  U^r^jand  Uq,  where  expression 
(9.24)  determines  UQ. 


In  the  corresponding  real  circuit,  there  is  a  requirement  to  separate  the 
matched  linear  filter  output  voltage  envelope  and  compare  it  at  moment  t  =  T  with 
some  real  threshold  Unp  (having  voltage  dimensionality).  Therefore,  if  one  assumes, 
as  was  the  case  when  examining  the  Figure  9.5  circuit,  that  a  linear  detector 
with  a  single  transfer  constant  separates  the  envelope,  then 


and,  in  accordance  with  formulas  (9.38)  and  (9.48),  we  obtain 
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where  Um  —  extant  envelope  noise  voltage  value  at  detector  output,  i.  e.,  at 
optimum  linear  filter  output. 


Since 
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one  also  may  assume  that 
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The  aforementioned  analysis  was  performed  for  a  signal  changing  with  respect 
to  law  (9.2),  i.  e.,  for  an  unmodulated  sinusoidal  signal.  If  the  signal  has 
a  more  common  form,  one  modulated  with  respect  to  amplitude  and  with  respect  to 
phase  (frequency),  i.  e., 


ur  {( )  o  (/>  c«>s  !«•*/  i  i(>(0  I  i|), 


(9.50) 


where  a(t)  and  ’i>(t)  -- -  precisely -known  time  functions,  then  it  is  possible  to 
show  [17]  that  all  aforementioned  results  will  remain  valid,  with  the  following 
provisos: 


a)  signal  energy  Qq  should  be  replaced  by  Q,  where 


Q  =  tJ 

U 


b)  the  inverse  probablity  of  signal  presence  equals  Py(C),  rather  than 
Py(aQ)  [formula  (9. 20)],  where  /132 


Af, 

r 

*1  =  $  y  (0  a  {/)  cos  (a>/  + 1|)  (/)|  dt\ 

b 

T 

J'i  =  5  y(/)a(Osinlti>/  +  i|>(/))d/, 


(9.51) 


where  P(C)  —  a  priori  probability  of  signal  presence. 

It  follows  from  these  relationships  that  parameter  1^,  just  as  was  the  case 
for  the  parameter  M  mentioned  previously,  may  be  computed  with  the  aid  of  an  optimum 
filter  and  amplitude  detector  or  by  the  Figure  9.1  computing  device.  In  the  latter 
case,  cos  tot  and  sin  «>/  should  be  replaced  everywhere  they  appear  by 
a(t)  coa  [  w/  +  ♦(/)]  and  a(t)  sin  [  tot  +  ♦(/)],  respectively. 

Optimum  filter  structure  also  is  complicated  accordingly. 

9.3  Binary  Detection  of  a  Fluctuating  Signal 

Let  the  signal  have  the  form 


u„  (/)  =  a  cos  (w/  -f  <r). 


(9.52) 


where  frequency  w  is  precisely  known,  while  amplitude  a  and  initial  phase  ‘P 
during  observation  cycle  T  are  unchanged,  but  fluctuate  from  one  observation 
(observation  cycle)  to  another,  i.  e.,  they  change  as  analog  random  magnitudes 
with  some  distributions  W(a)  and  P  (<p)  ,  respectively.  For  brevity,  this  is  called 
a  fluctuating  signal. 

We  will  assume  that  distribution  P(  <P  )  is  uiiform,  i.  e.,  it  is  described 
by  relationships  (9.3). 

Let  the  a  priori  probabilities  of  the  presence  and  absence  of  such  a  signal 
equal  P(C)  and  P(0),  respectively.  If  there  is  a  signal  at  input,  then  its  amplitude 
has  some  non-zero  value  a.  If  there  is  no  signal  at  input,  then  this  denotes 
that  its  amplitude  is  identical  with  zero  (a=sQ).  Therefore,  solution  of  the 
signal  detection  problem  means  answering  the  question  of  whether  or  not  amplitude 
a  has  some  non-zero  value  (it  is  unimportant  exactly  what  the  value  is)  or  if 
it  is  identical  with  zero. 

In  accordance  with  the  inverse  probability  approach,  the  answer  to  this  question 
requires  comparison  between  themselves  of  inverse  probabilities  Py(C)  and  Py(Q), 
where  Py(C)  —  inverse  probability  that  amplitude  a  has  some  non-zero  value  (it 
is  unimportant  exactly  what  value),  while  Py (0)  —  inverse  probability  that  amplitude 
a  is  identical  with  zero. 

It  follows  from  formula  (9.15)  that,  for  a  random-phase  equiprobable  /133 
signal,  amplitude  a  inverse  probability  density  Py(a)  equals 

P„(a)  =  P (a)e-“'mw’  1,^-^-)  .  (9.15) 

Since  Py(C)  is  the  inverse  probability  that  amplitude  a  has  any  non-zero 
value,  then 
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(9.53) 
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i.  e 


P(«)e— da,  (9.54) 

where  P(a)  —  a  priori  non-zero  amplitude  a  probability  density.  Therefore,  in 
this  case 

P(a)~P(C)W{a)  (9.55) 


and 


e» 

P,(C)~lhP(Ql  «f(e)c— r,,JV‘U(~)<to. 


(9.56) 


It  fallows  from  (9.15)  that 


Py(0)~ktP(0). 


(9.57) 


Coefficient  k2  included  in  formulas  (9.56)  and  (9.57)  is  determined  from 
a  normality  condition  which,  for  the  case  being  examined,  has  the  form 


P,(Q+P,  (0)=*  I. 


(9.58) 


For  simplicity,  in  future  we  will  proceed  from  the  maximum  inverse  probability 
criterion,  i.  e.,  assume  that  there  is  a  signal,  if 


P,(Q>P,(0).  (9.59) 

and  there  is  no  signal  if 

P,(C)<P»( 0)-  ^•6°) 

In  addition,  for  specificity  we  will  assume  that  signal  amplitude  distribution 
W(a)  is  subordinate  to  Rayleigh's  law,  i.  a., 
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W(a)  = 

«c* 


(9.61) 


Here,  expression  (9.56)  takes  the  form 


Py(Q  =k,P(C)4r\  «e— 1*»  1.(^2)  da, 
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where 


J— L+I— L/h-?2V 

*  5?  5?  v  r  w.  r 


QrP=J?T 


is  average  signal  specific  energy. 


(9.62) 


(9.63) 


(9.64) 


The  integral  in  expression  (9.62)  is  tabular  and  has  the  following  value: 
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Substituting  it  into  (9.62),  we  obtain 


PAQ^kyPiQ 

u? 


(9.65) 


It  follows  from  relationships  (9.57),  (9.59),  and  (9.65)  that  a  detector 
must  supply  the  response  "yes"  (signal  at  input)  when  the  following  inequality 
is  satisfied 


M  > 


(9.66) 


where 


(9.67) 


If  inequality  (9.66)  is  not  satisfied,  then  the  decision  is  that  there  is 
no  signal.  Consequently,  the  task  of  the  optimum  fluctuating  signal  detector 
boils  down  to  parameter  M  computation  and  its  comparison  with  threshold  . 


It  was  shown  in  the  preceding  section  that  magnitude  M  is  proportional  to 
the  value  (at  moment  T)  of  the  envelope  of  the  oscillation  at  optimum  linear  filter 
output.  Therefore,  as  was  true  for  a  signal  of  known  amplitude,  the  optimum  fluc¬ 
tuating  signal  detector  has  the  form  depicted  in  Figure  9.5.  All  this  circuit's 
parameters  also  will  remain  unchanged,  with  the  exception  of  threshold  bias  magnitude. 

This  optimum  receiver  coincidence  (except  for  threshold  bias)  for  cases  of 
fluctuating  and  non-fluctuating  signal  amplitudes  is  explained  by  the  fact  that 
the  receiver's  optimum  filter  OP  is  a  linear  system  whose  parameters  are  independent 
of  signal  amplitude;  the  shape  of  the  envelope  detector  response  curve,  as  noted 
above,  also  plays  no  role,  if  the  threshold  magnitude  is  adjusted  accordingly  /135 
when  it  changes. 

Hence,  it  follows  that  the  transition  from  optimum  detection  of  a  signal 
with  precisely-known  amplitude  to  optimum  detection  of  a  random-amplitude  signal 
subordinate  to  any  distribution  law  W(a)  may  be  accomplished  without  any  detector 
(receiver)  circuitry  changes,  except  for  the  corresponding  adjustment  of  threshold 

bias  U  '  at  output. 

0-. 

For  a  Rayleigh  law  of  amplitude  distr.  [formula  (9.61)],  the  threshold 

bias  magnitude,  reduced  to  function  p(«F)envej~r  -  .  equals  and  is  determined 

from  formula  (9.67).  However,  in  a  real  circuit  (Figure  9.5),  envelope  detector 
output  voltage  is  compared  at  moment  t  =  T  with  threshold  UQ’ .  If  it  is  assumed 
that  this  detector's  transfer  constant  (for  the  envelope)  equals  unity,  then  optimum 
linear  filter  output  voltage  equalling 
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is  compared  with  threshold  UQ'  at  moment  t  =  T. 

The  result  in  the  absence  of  noise  is 
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(9.69) 


It  follows  from  comparison  of  expressions  (9.16)  and  (9.68)  that  voltage 
u.H.  (7)  exceeds  magnitude  f%j  by  a  factor  of  .  Therefore,  real  threshold  Uq' 
must  exceed  threshold  u„^  determined  by  formula  (9.67)  also  by  factor  c*a  , 
i.  e.  | 


*«<««*• 


(9.70) 


Considering  relationships  (9.63),  (9.64),  (9.67),  (9.69)  and  the  fact  that 
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it  is  possible  to  reduce  formula  (9.70)  to  the  following  form: 


v*  -  t'T U.  |/(  I  +  .1  I  l„|£ffl.  (I  +  »>]  (9.71) 


(where  ~55;(1+4)<1  »  one  should  assume  *  0). 

r  (C) 
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where 


(9.72) 


Now,  we  will  find  fluctuating  signal  error  probabilities  P*  »»  and  /136 
/>0(n  ,  assuming  a  Rayleigh  signal  amplitude  distribution,  while  initial  phase 
is  uniform.  Here,  instantaneous  signal  voltage  changes  have  normal  distribution 
with  zero  expected  value  and  dispersion  (at  matched  filter  output)  equalling 


I  S?  . 


Since  noise  at  matched  filter  output  has  normal  distribution  with  zero  expected 
value  and  dispersion  Ua  ,  then  signa 1- plus-noise  will  have  normal  distribution 
with  dispersion 
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Ulm  =.Mc  am  +  l/i  •  (9.73) 

At  moment  t  =  T,  filter  output  voltage  envelope  H»ui(7)is  compared  with  threshold 
t/,p  (it  is  possible,  without  sacrificing  conformity  of  results,  to  assume  that 
the  detector  transfer  constant  equals  unity).  Therefore,  error  probabilities 
P*  »  and  Pnp  completely  are  determined  by  the  voltage  u,uw(T)  law  of  distribution 
(uniform)  and  by  the  threshold  Utp  magnitude. 

It  follows  from  what  has  been  stated  that  the  law  of  distribution  of  instan¬ 
taneous  filter  output  voltage  changes  will  remain  normal,  both  when  there  is  and 
when  is  not  a  signal.  The  only  thing  that  changes  is  the  dispersion  of  this  voltage 
from  magnitude  U*m  to  —  (/£  4-  .  Therefore,  the  envelope  uBUt(T)  law 

of  distribution  must  be  a  Rayleigh  distribution  when  a  signal  is  absent  as  well 
as  when  one  is  present.  Consequently, 
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Averaging  (with  respect  to  realizations)  the  left  and  right  sides  of  this  equality 
we  have 

<n  Qm 

v,  /v, 

Substituting  this  expression  into  (9.77)  and  considering  relationship  (9.72), 
we  obtain 


(9.78) 


Consequently,  formula  (9.78)  may  be  written  in  the  form 


(9.79) 


Using  formulas  (9.71,  (9.75),  and  (9,79),  it  is  not  difficult  to  compute 
error  probabilities  P«T  and  Pn p  for  given  values  q  and  P(0)/P(C).  Composite 
error  probability  ia  determined  by  formula 


Pom^P(0)Pnr  +  P(q  Pm. 


(9.80) 


For  example,  we  will  examine  a  case  where  P(C)  s  P(0).  Here,  considering 
the  normality  condition,  we  obtain 


P(Q-P(0)-0,5, 


(9.81) 


and  the  aforementioned  formulas  take  the  following  form: 


where 


P«  r=e 


-v 


np 


"  1 


l*=(\  +  \lq)\n(\+q). 
PonMP^  +  PnvW- 


The  Figure  9.8  curve  is  plotted  from  these  formulas. 


(9.82) 


Figure  9.8 


Given  permissible  probability  Pom magnitude,  it  is  possible  to  determine 
requisite  signal-to-noise  ratio  q.  If  a  priori  probabilities  P(-0)  and  P(C)  are 
unknown,  composite  error  probability  P*m  also  may  not  be  found  and  it  is  necessary 
to  be  given  conditional  error  probabilities  Pn  »  and  PBp  ,  rather  than  /138 
magnitude  Pon  .  Here,  as  follows  from  (9.75)  and  (9.79),  the  requisite  signal- 
to-noise  ratio  equals 
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I  Where 
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^np  ^0.1, 


it  is  possible  with  sufficient  precision  to  assume  that 
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Requisite  threshold  Uop  magnitude  is  determined  from  relationship  (9.75) 
and  equals 


Ue  will  compere  (9.84)  with  expression  (9.49)  obtained  in  §  9.2  for  a  signal 
with  a  precisely -known  amplitude.  In  the  case  of  a  signal  with  precisely-known 
amplitude,  requisite  energy  Qq  will  depend  on  error  probabilities  Pn  »  and 
P op  in  approximately  an  identical  manner  (where  P*  ,<  0,1  and  Pnp  <  0,1 ),  and 
will  do  so  logarithmetically.  For  a  fluctuating  signal,  the  dependence  of  average 
signal  energy  QCp  on  probabilities  P**  and  P op  is  considerably~clifferent: 
dependence  of  Qcp  on  P„  T  is  logarithmetic,  while  it  is  hyperbolic  on  ^np ,  i.  e., 

I 

significantly  more  defined.  Therefore,  for  a  fluctuating  signal  of  the  type  being 
examined  (i.  e.,  for  amplitude  fluctuations  with  respect  to  Rayleigh's  law),  it 
.is  significantly  more  difficult  to  insure  a  slight  miss  probability  than  a  slight 
false-alarm  probability.  It  also  follows  from  this  that,  given  a  slight  permissible 
miss  probability,  detection  of  a  fluctuating  signal  requires  that  the  signal  have 
considerably  more  energy  than  is  the  case  for  detection  of  a  signal  with  known 
amplitude. 

Analysis  of  the  impact  of  the  ambiguity  (fluctuation)  of  signal  rf  occupation 
phase  and  its  amplitude  an  required  signal  energy  performed  in  the  preceding  sections 
demonstrates  that,  given  slight  permissible  detection  error  probabilities,  phase 
ambiguity  does  not  play  a  significant  role,  while  amplitude  ambiguity  may  require 
a  very  significant  increase  in  its  average  energy. 

9.4  Impact  of  Receiver  Frequency  Characteristic  and  Bandwidth  Shape  on 
Detection  Quality  /139 

It  follows  from  the  Figure  9.5  circuit  that  the  basic  optimum  detector  element 
is  an  optimum  linear  filter  matched  with  the  anticipated  signal.  This  signifies 
that,  for  noise  i iw(i)  in  the  form  of  white  noise,  filter  transfer  constant  K(jfo) 
must  satisfy  relationship  (2.30),  i.  e.,  be  complex  conjugate  with  anticipated 
signal  spectrum  <S(/m)  .  However,  it  was  noted  in  §  2.3  that,  during  detection 
of  a  sinusoidal  radio  pulse  (like  the  one  depicted  in  Figure  2.7  or  2.19),  there 


IF*-®*  through  replacement  of  the  optimum  linear  filter  by  a  standard 

amplifier  with  bamfcidth  matched  with  the  width  of  the  signal  spectrum 


Ware  detailed  examination  of  this  problem  presented  in  [5]  for  a  signal  with 
known  amplitude  and  random  initial;  phase  made  it  pdsa&iie'.te plot  'the  Figure  9,9 
curves*  In  this  figure,  &f*u  —  product  of  amplifier  bandwidth  Af  and  signal 
radio  pulse  deration  T»  (with  sinusoidal  rf  occupation).  It  is  assumed  here  . 
that  time  T,  devoted  to  detection,  coincides  with  the  pulse  duration  (I  =  )j 
n.  —  signal  energy  utilization  factor,  i.  e, 


where  g  shd  g  * 


ere  a  given  detection  quality 
filter  and  standard  bend-pass 


detection  error 


Figure  9.10 


amplifier,  respectively,  ate  used*  It  ia  evident  that  magnitude  1 /V  equals 
requisite  eignei  energy  toeswhen  a  band-pass  amplifier  replaces  an  optimum  filter. 
Curve!  correapdnda 't#'  a-. '  dibit  ihesb’'  radii'  e(t)  and'  band- pa  as  amplify 

frequency  cfmractiristie  f(f)  aredeseribed  by  gaueeiaivcurvee*  curve  2  will 
relate  to  a  oiarefidihtictl  frequency  characteristic  K(f),  but  for  a  pulse  with 
e .  roctangjlir-  'anveloim.  Curve  >  ia  valid  for  the  case  of  rsctanculer  a(t)  and 


It  is  evident  from  the  figure  that,  given  appropriate  filter  bandwidth  Af 
selection,  the  loss  in  requisite  energy  either  is  absent  altogether  (curve  1) 
or  is  very  slight  (curves  2  and  3),  i.  e.,  does  not  exceed  1  dS.  In  a  case  where 
there  are  slight  band  deviations  from  optimum  value  *VniiT(jn  particular,  from 
value  1.37/t„  for  curve  3),  such  as  for  changes  not  greater  than  a  factor  of 
1.5,  the  loss  rises  slightly.  For  great  deviations  (for  instance,  for  band  broadening 
or  narrowing  by  a  factor  of  5  —  10  and  more),  the  loss  is  great. 


However,  when  several  additional  conditions  are  met,  a  change  in  filter  bandwidth 
by  a  factor  of  5  —  10  or  more  may  tum  out  to  be  permissible.  We  initially  will 
examine  these  conditions  relative  to  a  case  of  broadening  band  Af  compared  /140 
to  its  optimum  value  A /„„T.  In  this  case,  it  suffices  to  connect  an  af  filter 
with  bandwidth  AF  matched  with  signal  pulse  duration  at  envelope  detector  output 
(Figure  9.5),  i.  e.,  bandwidth 


i 


(9.86) 


and,  at  moment  t  a  T,  compare  it  with  this  filter's  output  voltage  threshold  [it 
is  evident  here  that  the  assumption  is  that  the  moment  of  anticipated  appearance 
(or  non-appearance)  of  a  pulse  at  detector  input  and  pulse  duration  t„  s  T  are 
known  beforehand  at  the  point  of  reception]. 

We  will  examine  a  case  where  sinusoidal  signal  amplitude  a  is  constant  and 
known  within  observation  cycle  limits  0  —  T,  while  initial  phase  is  constant  and 
equally  probable  in  the  interval  0  —  27  We  will  assume  that  Af  >  1/ t„  and, 
consequently,  Af  >  AF.  In  this  case,  it  is  possible  approximately  to  consider 
that  af  filter  output  voltage  has  a  normal  law  of  distribution,  both  when  a  signal 
is  and  is  not  present.  Therefore,  it  is  possible  to  assume  that 


(9.87) 
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where  U^m  and  Us,  —  noise  constant  component  and  dispersion  at  detector  output 
when  there  is  no  signal,  while  U-cw  and  Uch  —  analogous  parameters  when  there 
is  a  signal  (i.  e. ,  signal-plus-noise). 

Dispersions  Ul,  and  Ueh  are  determined  within  af  filter  bandwidth  limits. 

This  filter's  transfer  constant  at  the  zero  frequency  is  assumed  to  be  unity. 

Here,  since  condition  (9.86)  is  met,  it  is  possible  approximately  to  assume  /141 
that,  by  moment  t  =  T  of  comparison  with  the  threshold,  average  filter  output 
voltage  values  and  the  dispersions  of  these  voltages  essentially  achieve  steady-state 
values. 

Considering  the  aforementioned  assumptions  and  also  assuming  P„p  ^0,1  and 
Pa  T  <  0,1,  it  is  possible  based  on  (9.87)  to  obtain  the  following  expression  valid 
both  for  a  square  and  for  a  linear  envelope  detector: 


Comparing  it  with  expression  (9.49)  for  an  optimum  linear  filter,  we  find  out 
that 

—  =51= _ YiSjf _ 

”  9  ]/ -jr~  +  \f  ln-~  (9.88) 

The  result  obtained  in  [127]  for  a  fluctuating  signal  and  P„  T  pnp  coincides 
with  formula  (9.88)  where  AfT  '*>•  1  (if  in  it  one  assumes  that  Pn  T  P np),  while, 
for  AfT  >-  1  (andP;,  T  /'<*)),  it  is  depicted  by  the  Figure  9.10  curves.  The 
curves  demonstrate  that  formula  (9.88)  is  sufficiently  precise  already  where  AfT 
50  ~  100  (here,  as  was  the  case  above,  we  assume  that  T  =  rn  ).  it  also  follows 
from  this  that,  even  when  band  Af  is  broadened  in  comparison  with  ..VOIIT-s  l  i„ 
by  a  factor  of  5  —  10,  the  loss  in  requisite  signal  energy  does  not  exceed  2  — 

3  d8,  i.  e.,  is  considerably  less  than  when  there  is  no  af  filter  at  detector 
output  (compare  the  Figure  9.9  and  9.10  curves).  Consequently,  connection  at 
detector  output  of  an  af  filter  with  a  band  determined  by  relationship  (9.86) 
makes  the  detector  less  critical  to  broadening  of  the  bandwidth  of  the  filter 
preceding  the  detector. 


We  now  will  explain  under  what  conditiona  it  ia  possible  to  insure  slight 
detector  criticality  to  a  significant  narrowing  of  the  bandwidth  of  the  filter 
preceding  the  detector  (compared  with  A/„nT  *  1  /t„)  .  As  was  the  case  previously, 
we  will  assume  that  the  signal  has  the  form  of  a  sinusoidal  pulse  with  a  rectangular 
envelope  and  the  moment  of  anticipated  appearance  (or  non-appearance)  of  the  signal 
at  input  is  known.  Also  known  is  signal  pulse  duration  t„  ,  equalling  time  T 
devoted  to  signal  detection. 

Under  these  conditions  gating  of  filter  00  preceding  the  detector  also  /142 
is  possible  (Figure  9.5),  i.  e.,  connecting  it  only  at  time  interval  OtT  of 
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Figure  9.11.  (a)  —  OF’  [optimum  filter];  (b)  —  Decision. 

anticipated  signal  arrival.  It  is  possible  to  consider  the  gating  operation  to 
be  connection  ahead  of  the  filter  (or  beyond  it)  of  a  stage  multiplying  duration 
T  by  rectangular  video  pulse  u°n(0  ,  as  shown  in  Figure  9.11. 

Realized  here  in  the  optimum  detector,  in  essence,  is  the  optimum  correla¬ 
tion-filtration  device  described  in  §  4.3  (see  Figure  4.4).  Gating  voltage  cor¬ 
responds  to  function  f^(t),  while  filter  00'  must  be  matched  with  signal 

u'x(t)  =  f2(t), 

where  f2(t)  —  any  function  meeting  condition  (4.19),  i.  e. ,  such  that,  following 
multiplication  by  f^(t),  it  provides  an  oscillation  coinciding  with  anticipated 
signal  ux(t). 

It  is  clear  from  Figure  9.12  that  it  ia  possible  to  accept  as  function  ux’(t) 
not  only  actual  anticipated  signal  u  (t),  but  also  any  other  signal  of  greater 
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Figure  9.12 


(but  finite)  duration  which  coincides  with  the  anticipated  signal  only  in  sector 
0  —  T  (the  signal  depicted  in  Figure  9.12c  and  9.12d,  for  instance). 

This  denotes  that,  given  the  gating  stage  in  the  Figure  9.11  circuit,  it 
is  possible  to  select  filter  00' matched  with  a  signal  of  greater  (than  tb  )  duration 
and  having  any  envelope  shape  outside  the  range  0  —  T.  But,  optimum  filter  bandwidth 
narrows  when  signal  duration  increases,  while  the  shape  of  this  filter's  frequency 
characteristic  also  changes  with  a  change  in  envelope  shape.  Therefore,  it  is 
evident  that,  when  gating  is  used,  the  impact  of  filter  bandwidth  narrowing  (compared 
with  A/„„t  «  I/t„  )  and  a  change  in  frequency  characteristic  shape  on  detection 
quality  radically  decreases. 


In  addition,  it  follows  from  this  that,  if  a  quasi-optimum  standard  resonant 
amplifier  with  band  Af,  for  example,  replaces  the  optimum  filter,  the  filter  bandwidth 
narrowing  compared  to  1/t,  also  will  have  little  impact  on  detection  quality. 


This  result  may  be  explained  in  the  following  manner.  When  gating  is  /143 
used,  there  is  a  rise  in  both  signal  amplitude  and  noise  dispersion  in  time  interval 
0  —  T  at  filter  output.  If  Af  <  1/t„  ,  then,  by  moment  t  =  T,  both  signal  voltage 

magnitude  and  noise  dispersion  magnitude  turn  out  to  be  slight  (compared  to  the 
steady-state  value);  therefore,  there  is  no  deterioration  in  the  signal-to-noise 
ratio  (in  a  real  circuit,  it  is  inacfc/isible  all  the  same  to  select  band  Af  that 
is  too  slight  since,  here,  the  filter  transfer  constant  will  decrease  radically 
and  additional  amplification  must  be  used  to  compensate  for  this  effect). 

It  follows  from  what  has  been  stated  that,  in  several  cases,  it  is  possible 
to  obtain  a  detector  slightly  critical  to  significant  (by  several  factors)  changes 
in  the  bandwidth  of  the  filter  preceding  the  detector  and  to  changes  in  the  shape 
of  this  filter's  frequency  characteristic. 

9.5  Binary  Phase  Detection 

It  was  assumed  during  examination  of  binary  detection  in  Chapter  5  and  in 
the  preceding  sections  of  this  chapter  that  a  detector  (receiver)  optimally  considers 
all  usable  information  contained  in  signal-plus-noise. 

It  is  convenient  for  relatively  narrow-band  signals  to  represent  signal-plus- 
noise  in  the  form  of  a  single  oscillation  with  some  resultant  amplitude  UP(t)  and 
resultant  initial  phase 

y  (/)  -  l/p (0  COS  [to/  +0(01.  (9.89) 

In  the  general  case,  information  on  the  usable  signal  will  be  contained  both 
in  envelope  &M0and  in  phase  ,0(0  of  resultant  oscillation  y(t).  However, 
in  some  cases,  usable  information  may  be  contained  wholly  (or  almost  wholly)  only 
in  envelope  Uv{i\  or  only  in  phase  0(0* 

Actually,  the  above  comparison  of  binary  detection  of  precisely-known  signals 
and  of  random  initial  phase  signals  demonstrated  that,  given  slight  signal-to- 
noise  ratio,  the  uncertainty  of  the  initial  phase  (or  failure  to  take  account 
of  this  phase  in  the  detector)  will  lead  to  a  large  loss  in  requisite  signal  energy. 


This  loss  decresses  monotonicslly  by  virture  of  increasing  the  signal-to-noise 
ratio  and,  given  sufficiently-strong  signals,  essentially  equals  zero. 

This  denotes  that,  in  the  case  of  a  precisely -known  signal,  usable  information 
will  be  contained  both  in  envelope  U^iO  and  in  phase  0(()  of  the  resultant  oscil¬ 
lation.  But,  given  a  slight  signal-to-noise  ratio,  this  information  is  concentrated 
mainly  in  phase  !  0(f)  (since,  in  this  case,  failure  to  account  for  0(0,  occurring 
in  the  amplitude  detector,  will  lead  to  a  sharp  decrease  in  detection  reliability), 
while  it  is  concentrated  mainly  in  envelope  Up(t) ,  when  the  signal-to-noise  ratio 
is  high. 

Consequently,  given  slight  slight  signal-to-noise  ratios,  phase  detectors, 
i.  e. ,  devices  reacting  only  to  phase  .  0(0  of  the  resultant  oscillation  and 
not  reacting  to  its  amplitude,  may  provide  just  as  good  results  as  an  optimum  /144 
detector,  which  analyzes  entire  oscillation  y(t).  Therefore,  along  with  the  afore¬ 
mentioned  detectors  based  an  analysis  of  entire  oscillation  y(t)  or  its  envelope 
l/p(0,  phase  detectors  are  of  great  interest. 

Interest  in  phase  detectors  especially  rose  when  the  following  difficulty, 
which  arises  during  practical  realization  of  detectors  which  analyze  instantaneous 
values  y(t)  and  amplitude  of  oscillation  ^p(0,  was  encountered. 

Threshold  bias  U0 p  is  set  at  the  output  of  such  detectors.  If  a  priori 

probabilities  P(0)  and  P(C)  are  known,  then  requisite  optimum  magnitude  Oop  will 
depend  both  on  signal-plus-noise  input  intensities,  and  on  receiver  amplification 
[see  formulas  (5.15)  and  (9.71),  for  example].  If  probabilities  P(0)  and  P(C) 
are  unknown,  then  the  threshold  magnitude  is  established  based  only  on  the  permissible 
false-alarm  probability  [see  formulas  (5.33b)  and  (9.85),  for  instance].  Here, 
requisite  magnitude  Upp  will  not  depend  on  signal  intensity,  but  dependence  on 
input  noise  intensity  and  receiver  amplification  i8  retained.  Therefore,  in  all 
cases,  a  change  in  receiver  amplification  (as  a  result  of  instability)  requires 
corresponding  adjustment  in  threshold  Uop  magnitude. 

There  is  no  such  difficulty  in  phase  detectors  since  phase  0(0  of  the 
resultant  oscillation  will  not  depend  on  receiver  amplification.  In  addition, 
in  recent  years  radiotechnical  phase  systems,  which  include  an  amplitude  detector, 
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have  become  widespread  (see  [177],  for  example).  Here,  it  is  important  to  know 
whether  or  not  it  is  permissible  to  use  limiter  output  voltage  both  for  signal 
detection  and  for  signal  parameter  measurement.  For  these  reasons,  a  series  of 
works  [16,  124,  131,  177,  181,  and  others]  has  been  devoted  in  recent  years  to 
examination  of  phase  detection  methods.  We  will  dwell  briefly  on  the  most  important 
results  obtained  in  them. 


Initially,  we  will  examine  optimum  phase  detection,  in  which  information 
included  in  phase  9  (t)  is  used  in  the  best  theoretically -possible  manner.  This 
denotes  that  the  signal/no  signal  decision  will  be  based  upon  comparison  of  inverse 
probabilities  P$(Q  and  P$(0)  of  signal  presence  and  absence  for  given  realization 
9 (t)  of  input  signal-plus-noise  phase. 


We  will  assume  that  signal  ue(/)  is  precisely  known,  i.  e., 


iie(/)  -  a  (/)  cos  forf +«p(/)|, 


where  parameters  a(t),  T.(t) ,  and  »  are  precisely  known.  Then 


(9.90) 


P,(O-*P(qPe{0), 

P9(0)-*P(0)P,(0). 


(9.91) 


where  P(C)  and  P(0)  —  signal/no  signal  a  priori  probabilities,  while  PQ(  9  ) 
and  PQ(  6  )  —  multidimensional  phase  d  distributions  for  signal  presence  and 
absence,  respectively. 

A  "yes"  decision  (signal)  results  in  the  optimum  detector  if  /145 


P9(C)>f\/>9(0), 


(9.92) 


and  a  "no"  decision  (no  signal)  results  in  the  opposite  case.  Here,  q  —  weight 
factor  considering  the  relative  danger  of  signal  false  alarms  and  misses.  If 
both  types  of  errors  are  equally  dangerous  (i.  e.,  there  is  a  requirement  to  provide 
minimum  composite  error  detection  probability  P os),  then 
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(9.93) 


It  follows  from  (9.91)  and  (9.92)  that  a  “yes"  decision  is  required  if 


(9.94a 


where 


(9.94b 


and  a  "no"  decision  is  required  in  the  opposite  case. 


As  was  demonstrated  in  §  1.3,  any  time  f met  ion  f(t)  with  a  relatively  narrow 
band  spectrum  with  width  Af  completely  is  characterized  by  n  values  (f^,  ...» 
fR)  of  this  function  or  n^  values  of  its  envelope  and  n^  values  of  phase  [see 
(1.13)  and  (1.14),  for  example],  where 


(9.95) 


Here  T  —  observation  cycle  duration. 

In  our  case,  signal-plus-noise  y(t)  plays  the  role  of  f(t),  while  its  initial 
phase  9(t)  plays  the  role  of  phase.  Consequently,  function  0  (t)  of  interest 
to  us  completely  is  characterized  by  its  selected  n^  values 


(8i»  9»*  0»,). 


where  n.  is  determined  from  formula  (9.95). 
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Therefore, 


Pc  (0)  _  Pc(0i»  0»|) 

Pci 0)  ~  />.(», . 0«,)  *  (9.96) 

It  is  assumed  further  for  simplicity  that  the  values  selected  (  0  ^,  .  .  ., 

0nj)  statistically  are  independent.  Then,  from  (9.94a)  and  (9.96)  we  obtain 
the  following  decision  '•yes'*  rule: 


In 


ri  «*e  (®»> 
<- 1 


n 

/>  i 


>c. 


(9.97) 


where  trc(0 ,)  and  —  unidimensional  probability  densities  of  signal-  /146 

plus-noise  initial  phase  Q  and  of  noise  alone,  respectively,  which  are  computed 
relatively  simply  (see  [177],  for  instance).  However,  in  spite  of  this,  resultant 
expression  (9.97)  is  unwieldly.  Therefore,  cases  of  a  slight  »*  *)  and 

a  high  WUm  ^  >  1)  signal-to-noise  amplitude  ratio  at  input  usually  are  examined 
separately . 

As  already  noted  above,  for  a  high  signal-to-noise  ratio,  optimum  phase 
detection  must  supply  a  large  loss  compared  with  completely -optimum  detection 
[i.  e,  with  detection  based  on  processing  of  entire  input  realization  y(t),  rather 
than  only  its  phase  9  (t)].  Mathematical  calculations  actually  confirm  this 
fact  [131,  177,  and  others].  Therefore,  we  will  dwell  only  on  the  case  of  slight 
signal- to-noiae  ratio  a /Um%t  ,  for  which  optimum  phase  detection  approximates 
the  fully  optimum. 

At  first  glance,  it  would  appear  that  a  case  of  slight  signal-to-noise  ratio 
a!Um  n  is  not  of  interest  since,  here,  it  is  impossible  to  insure  high  detection 
validity.  However,  in  actuality,  this  is  not  the  case,  for  detection  error  proba¬ 
bilities  are  determined  by  signal-to-noise  power  ratio  Q/Nq  rather  than  amplitude 
ratio  a/Umn  .  Therefore,  if  time  T  devoted  to  detection  is  sufficiently  great, 
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then,  even  given  Blight  ratio  a/Um%it  it  is  possible  to  obtain  high  power  ratio 
Q/Nq  and,  consequently,  slight  detection  error  probabilities. 

Hence,  it  follows  that,  when  phase  detection  is  used,  good  results  only  may 
be  obtained  when  the  following  two  conditions  are  met  simultaneously: 


«=«'■ 


(9.98) 


(9.99) 


where 


(9.100) 


—  extant  value  of  the  noise  voltage  contained  in  input  signal- plus-noise  y(t). 
Since  Q  =  a^T/2,  then  conditions  (9.98)  and  (9.99)  may  be  met  simultaneously  only 
where 


(9.101) 


i.  e.,  in  cases  where,  during  observation  cycle  T,  one  succeeds  in  obtaining  a 
sufficiently-large  number  n^  of  independent  selective  values.  When  conditions 
(9.98)  and  (9.101)  are  met,  enalyais  of  decision  rule  (9.97)  provides  the  following 
results  (see  [131]  or  [177],  for  wampie). 


A  signal  preaent  decision  must  be  made  if  this  inequality  is  satisfied 


j  a  (!)  co*  |0  (/)— f  (/»«//  >l\ 


(9.102) 


I 


'-*■ 


(9.103) 


If  inequality  (9.102)  is  not  satisfied,  a  no  signal  decision  results. 


Hence,  it  follows  that,  under  the  aforementioned  conditions  Ut'V„ ,  «  I. 

A fT  >  1)  ,  an  optimum  phase  detector  must  comprise  an  ideal  phase  detector*,  which 
transforms  the  (9.89)  input  mixture  into  cos  [  6  (t)  —  <t>  ( t )  3 ,  a  correlator, 

which  computes  the  cross  correlation  of  the  phase  detector  output  voltage  and 
signal  envelope  a(t),  and  threshold  bias  UQ  at  output.  Here,  it  turns  out  that 
error  probabilities  Pn  T  and  P„ p  are  determined  by  formulas  differing  from  corres¬ 
ponding  formulas  (5.22)  and  (5.27b)  for  a  completely -optimum  detector  only  in 
that  they  will  contain,  in  place  of  signal  energy  Q,  magnitude 


Q- 


(9.104) 


This  signifies  that,  when  conditions  (9.98)  and  (9.101)  are  met,  an  optimum  phase 
detector  provides  identical  detection  error  probabilities  that  a  complet ely-opt imue 
detector  supplies  when  input  signal  energy  is  increased  by  a  factor  of  4/n  . 

In  other  words,  replacement  of  a  complet ely-opt imum  detector  by  an  optimum 
phase  detector,  given  the  aforementioned  conditions,  will  lead  to  only  a  1  d) 
loss  in  requisite  signal  energy  magnitude.  The  structure  of  an  optimum  phase 
detector  determined  from  algorithm  (9.102)  turns  out  to  be  more  complex  than  that 
of  a  completely -opt imum  detector  of  the  same  signal  (i.  e.,  of  a  precisely-known 
signal).  There  is  no  difficulty  in  becoming  convinced  of  this  if  you  compare 
expressions  (5.110  with  (9.46b)  and  (9.102). 

However,  it  is  possible  to  demonstrate  (see  [177],  for  example)  that,  when 
conditions  (9.98)  and  (9.101)  are  met,  the  Figure  9.13  phase  detector  may  provide 

*As  opposed  to  a  standard  phase  detector,  ideal  phase  detector  output  voltage 
must  not  depend  on  input  voltage  envelope  <Vo. 
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results  essentially  identical  to  the  optimum  phase  detector.  Here,  Y  ~  wi  de-band 
amplifier;  HO  —  ideal  bilateral  limiter.  Amplifier  bandwidth  must  be  sufficiently 
broad  2-'  HT)%  to  avoid  an  information  loss  concerning  input  signal-plus-noise 
y(t)  phase  d(t). 


Figure  9.13.  (a)  —  10  [ideal  limiter];  (b)  —  D  [decision]. 

Compering  the  Figure  9.13  circuit  with  the  Figure  5.4  completely -optimum  /14a 
detector  and  considering  that  the  correlator  may  be  built  based  on  the  Figure 
4.3  circuit,  it  is  not  difficult  to  become  convinced  that  the  given  phase  detector 
circuit,  in  essence,  differs  only  by  inclusion  of  the  ideal  limiter. 

Inclusion  of  the  limiter  complicates  the  detector,  but  imparts  to  it  the 
basic  property  inherent  in  a  phase  detector— insensitivity  to  amplification  insta¬ 
bility. 

The  results  presented  in  this  section  relate  to  a  precisely -known  signal. 

It  is  possible  in  principle  to  use  the  phase  approach  also  for  detection  of  a 
random  initial  phase  signal.  However,  mathematical  calculations  and  detector 
structure  significantly  are  complicated  here  (see  [131  and  [177],  for  example). 
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CHAPTER  TEN 


/148 


COMPLEX  BINARY  DETECTION 
10.1  General  Relationships 

Let  the  signal  have  one  of  m  values: 

«i (0-  ut(t), um(t). 

The  requirement  is  to  detect  whether  such  a  signal  is  present  at  system  input 
or  there  is  no  signal,  only  noise. 

Solution  of  this  problem  is  equivalent  to  finding  the  answer  to  the  question 
of  whether  any  of  m  non-zero  signals  u^,  Uj,  .  .  .,  um  (it  is  immaterial  exactly 
which  one)  is  present  at  system  input  or  whether  there  is  no  signal,  only  noise. 
This  task  is  called  complex  binary  detection. 

Detection  is  binary  because  only  two  types  of  answer  are  possible— "yes" 

(one  of  m  non-zero  signals  is  present)  and  "no"  (no  signal,  only  noise,  is  present) 
as  was  the  case  for  binary  detection  examined  in  the  preceding  chapter. 

However,  in  this  case,  detection  is  more  complex  since  the  signal  has  not 
one,  but  m  possible  types. 


Such  detection  is  of  interest,  for  example,  in  radar,  when  the  requirement  /149 
is  to  determine  whether  there  is  an  aircraft  in  some  region  of  space  consisting 
of  m  elementary  sectors,  not  specifying  in  exactly  which  sector  it  is  located 
(it  is  assumed  that  only  one  aircraft  may  simultaneously  be  located  in  the  entire 
region).  Here,  a  return  from  an  aircraft  located  in  the  k-th  elementary  sector 
may  be  considered  possible  signal  u^(t)  and  it  may  be  considered  that  establishment 
of  the  presence  of  one  of  possible  signals  u^,  .  .  . ,  um  is  equivalent  to  detection 
of  the  presence  of  an  aircraft  in  the  region  being  examined. 

Let  a  priori  probabilities  P(0),  P(u^),  .  .  .,  P(um)  of  the  absence  and  presence 
of  signals  u^,  u^,  .  .  .,  um,  respectively,  be  known.  Then,  it  is  possible  to 
compute  corresponding  inverse  probabilities  Py(0),  Py(u^),  .  .  .,  Py(um). 

It  follows  from  the  normality  condition  that 


m 

P(0)+  2  p <"*)  =  !; 

I 

m 

/yo)+  2  p*  («»)-'• 

km  | 


(10.1) 


Since  appearances  of  signals  u^(t),  .  .  .,  uffl(t)  are  incompatible  events, 
then  inverse  probability  Py(C)  of  appearance  of  any  of  m  possible  signals  (it 
is  immaterial  which  one)  equals  the  sum  of  the  inverse  probabilities  of  the  appearance 
of  each  of  these  signals,  i.  e., 


P,(C) 


2  P9  ("»)• 


(10.2) 


Analogously,  for  a  priori  probability  P(C)  of  this  event  we  have 


P(Q-  'ZPfrO- 


*-i 


(10.3) 


Solution  of  the  signal  detection  problem  using  the  maximum  inverse  probability 
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approach  requires  computation  and  comparison  between  themselves  of  inverse  proba¬ 
bilities  Py(C)  and  Py(0)  of  the  desired  events.  If  it  turns  out  that 

P9(C)>PV  (0).  (10.4) 

then,  the  answer  must  be  "yes”  (one  of  m  possible  signals  is  present).  In  the 
opposite  case,  the  answer  is  "no"  (no  signal,  only  noise).  A  receiver  operating 
on  this  principle,  as  already  noted  in  Chapter  4,  provides  minimum  composite  error 
decision  probability.  Here,  no  consideration  is  given  to  differences  in  the  sig¬ 
nificance  (danger)  of  various  types  of  errors,  i.  e.,  of  signal  false  alarms  and 
misses.  If  a  different  "weight”  is  to  be  attached  to  false  alarms  and  misses, 
then  this  may  be  considered  by  introduction  of  appropriate  weight  factor  *1- 
into  condition  (10.4).  Therefore,  in  the  more  general  case,  the  decision  /150 
must  be  "yes"  when  this  condition  is  met 

Py  (Q  >  r\Plt  (0)  (10.5) 

while  the  decision  is  "no"  in  the  opposite  case. 

Considering  (10.2),  this  condition  takes  the  form 

m 

Jmi'  Py  (Uh)  >  (/#»  ( 10 . 6) 

where 

Vo^nPyfO). 


Consequently,  the  optimum  receiving  device  must  compute  inverse  probabilities 

Py^),  .  .  .,  Py(um)  oil  possible  signals,  sum  them,  and  compare  them  with 

some  threshold  U  . 

o 

Therefore,  the  functional  diagram  of  such  a  device  in  the  general  case  must 

take  the  form  depicted  in  Figure  10.1.  In  this  figure,  /7„  /7, .  /7„, —  receivers 

computing  inverse  probabilities  Py(u^),  Py(u2^»  •  •  •»  Py(um)  of  *'he  corresP°ndin9 
simals. 


In  simple  binary  detection  of  some  signal  u  ,  described  in  the  preceding 
chapter,  the  job  of  the  optimum  receiver  boils  down  to  computation  of  inverse 
probability  P  (u  )  of  this  signal  and  comparing  it  with  some  threshold.  Therefore, 

y  c 

each  receiver  nk  in  the  Figure  10.1  system  carries  out  functions  identical  (except 
for  threshold  comparison)  as  was  the  case  for  simple  binary  detection  of  the  corres¬ 
ponding  signal  u^  and,  consequently,  has  the  same  basic  structure. 

For  our  illustration,  we  will  examine  several  specific  cases  of  complex  binary 
detection,  assuming,  as  was  the  case  previously,  that  noise  is  additive  and  white 
and  has  a  normal  distribution. 

10.2  Detection  of  Precisely-Known  Signals 

Let  all  possible  signals  u^(t),  .  .  .,  um(t)  be  precisely  known.  In  this 

case,  the  task  of  each  receiver  n *  boils  down  to  computation  of  inverse  probability 

P  (u.  )  of  a  precisely-known  signal, 
y  k 

It  was  found  in  Chapter  5  that  /151 

Py  («»)  =  4,  P(uk)e~°h/X*  c\  (5.4) 

where 

5*  -  -J- 1 Vi*) 

It  was  also  demonstrated  at  that  time  that  magnitude  P  (u.  )  may  be  computed 

y  k 

by  a  receiver  built  according  to  the  Figure  5.3  structural  scheme.  The  basic 
element  of  such  a  receiver  is  correlator  KOP,  which,  given  several  assumptions 
pointed  out  in  Chapter  4  that  usually  are  the  case,  may  be  replaced  also  by  an 
optimum  linear  filter. 

Consequently,  in  the  case  of  precisely -known  signals,  each  receiver  in 
the  Figure  10.1  schematic  may  be  built  according  to  the  Figure  5.3  structure. 


10.3  Detection  of  Random-Phase  Signals 


Let  all  signals  u^(t),  .  .  .,  uffi(t)  be  precisely  known ,  with  the  exception 
of  rf  occupation  phase,  which  is  random  and  equally  probable.  In  this  case,  the 
task  of  each  receiver  //*  boils  down  to  computation  of  inverse  probability  P  (u. ) 

y  K 

of  sicpial  with  a  random  equally -probable  phase. 

The  appropriate  expression  for  P  Cu. )  was  obtained  in  §  9.1  [see  formula 

y 

(9.15)]  and  may  be  written  in  the  form 


i.  (*$-)•  (io.7) 

2 

where  Qk  =  ak  r  k/2  —  signal  uk(t)  energy  with  amplitude  ak  and  duration  t  k< 

Magnitude  Mk  is  determined  in  accordance  with  formulas  (9.12)  and  (9.13) 
in  the  following  manner: 


Al*  =*  \r Xl  +  Y\, 
where  »*+«» 

Xh  =  |  y(t)  cos 

•k 

•k+xh 

Yk  =•>  l  y  (t)  sin  a*  t  dt. 

•k 


(10.8) 


Here,  tk  and  tk  +  t  k  —  moment  of  signal  uk(t)  onset  and  disappearance,  assumed 
to  be  known  at  the  point  of  reception. 

Formulae  (10.7)  and  (10.8)  are  written  for  the  general  case  where  signals  /152 
u^(t),  Uj(t),  .  .  .,  um(t)  differ  from  one  another  with  respect  to  amplitude  ak, 
carrier  frequency  'u  k,  duration  *k,  and  moment  of  onset  tk>  If  part  of  these 
parameters  for  all  signals  are  identical,  this  simplifies  formulas  and  optimum 
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system  design  accordingly.  In  the  general  case,  the  structural  scheme  has  the 
form  depicted  in  Figure  10.1,  while  each  receiver  fl *  accomplishes  the  operations 
required  for  computation  of  inverse  probability  P  (u.  )  using  formula  (10.7). 

y  " 

As  pointed  out  in  §  9.1,  magnitude  may  be  computed  with  the  aid  of  electronic 
circuits  depicted  in  Figure  9.1  or,  given  several  limitations  usually  satisfied, 


yfti 


UU  it*  I 


Figure  10.2.  (a)  —  OF^  [optimum  filter  k];  (b)  —  D  [detector]; 

(c)  —  RK  [difference  stage];  (d)  —  EK  [exponential  stage]. 

with  the  aid  of  an  optimum  linear  filter.  In  the  latter  case,  receiver  flh  may 
be  built  in  accordance  with  the  structural  scheme  depicted  in  Figure  10.2.  Here, 
00*  —  optimum  linear  filter  matched  with  signal  u^t),  while  A  ~  envelope 
detector  separating  the  optimum  filter  output  voltage  envelope  and  clipping  the 
value  of  this  envelope  at  the  moment  signal  uk(t)  ceases,  i.  e.,  at  moment 
tk  +  Tk*  Here,  the  response  curve  must  be  in  the  form  In  IQ(x),  rather  than 
linear.  As  noted  in  §  9.1,  such  a  curve  may  be  obtained  from  a  standard  thermionic 
diode  detector. 

— In  P(a*)  +  is  computed  in  difference  stage  PK  from  voltage  clipped  by 

detector  A  and  eqtmlling  In  .  The  result  equals  In  P  (u.)  [since  factor 

kj  in  formula  (10.7)  may  be  assumed  to  equal  unity  without  harming  the  results]. 

Therefore,  stage  3K  with  an  exponential  characteristic  converting  In  P  (u.  )  into 

y  k 

P  (u. ) ,  is  connected  beyond  the  difference  stage, 
y  k 

Since  voltage  corresponding  to  P  (u^  is  obtained  at  moment  +  Tk,  which 
can  differ  for  different  signals  irCt),  then  summing  voltages  of  the  type 
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Uk  =  Py(uk),  accomplished  in  stage  £  (Figure  10.1),  in  the  general  case  also 
must  include  within  itself  the  corresponding  time  delays  of  the  individual  summands. 

10.4  Detection  of  Fluctuating  Signals  /153 


Let  all  signals  u^(t),  . 
in  §  9.3. 


.  . ,  um(t)  be  fluctuating  signals  of  the  type  examined 


In  this  case,  every  receiver  f7k  must  compute  inverse  probability  Pu(u.), 

y  k 

which,  in  accordance  with  formulas  (9.63)  and  (.63),  may  be  written  in  the  form 


Py  («*)  =*» 


(10.9) 


where 


2 ul  -y  al 

C",="*T,"V 


(10.10) 


Magnitude  Mk  is  determined  exactly  as  it  was  in  the  preceding  case.  Factor 
\*2  during  circuit  design  may  be  assumed  to  equal  unity.  Therefore,  the  receiver 


(Al  A*  M* 


Figure  10.3.  (a)  —  0Fk  [optimum  filter  k];  (b)  —  D  [detector]; 

(c)  <**»  NK  [normalization  stage]. 

n »  structural  schematic  in  the  examined  case  may  be  represented  in  the  form  depicted 
in  Figure  10.3.  Optimum  linear  filter  0*fik  has  the  same  structure  as  in  the  Figure 
10.2  circuit.  Linear  envelope  detector  A  ,  with  transfer  constant  equalling 
hk  at  moment  tk  +  T  k  clipe  voltage  equalling  hkMk.  The  stage  with  characteristic 


o''  forms  magnitude  e**™*'  ,  which,  following  the  corresponding  normalization 
in  stage  MK  ,  provides  desired  inverse  probability  P.,(uk). 

y  k 

10.5  Detection  of  m  Orthogonal  Signals 

Let  signals  u^t),  .  .  .,  um(t)  be  equally  probable,  differing  only  in  position 
over  time  and  thereby  not  overlapping  over  time.* 

In  this  case,  the  general  structural  schematic  of  the  optimum  detection  system 
(Figure  10.1)  radically  is  simplified  since,  instead  of  m  receivers  of  the  Hk 
type,  a  total  of  only  one  such  receiver  suffices. 

Actually,  since  signals  u^t)  are  equally  probable  and  differ  only  with  /154 
respect  to  possible  time  of  arrival,  it  follows  that  inverse  probability  Py(uk) 
of  each  signal  may  be  computed  with  the  aid  of  the  identical  circuit  elements 


Figure  10.4.  (a)  —  PA  [receiver  1]}  (b)  —  Timer;  (c)  —  Delay  circuit. 

without  mode  restructuring.  Since  possible  signal  u^(t),  .  .  .,  um(t)  values 
do  not  overlap  in  time  and  receiver  fl n  must  be  connected  only  during  time  k 
when  the  anticipated  signal  is  active,  then  the  same  receiver  flk  may  be  used 
sequentially  (over  time)  to  compute  the  probabilities  Py^),  .  .  .»  Py(um)  of 
all  possible  signal  values.  Here,  the  Figure  10.1  circuit  takes  the  form  depicted 
in  Figure  1Q.4|  while  Figure  10.5  depicts  the  time  diagram  characterizing  this 
circuit's  action. 

*If  signals  u^»  •  •  •»  um  havB  parasitic  random  parameters,  then  it  is  assumed 
that  a  priori  laws  of  distribution  of  those  parameters  are  identical  for  all  signals. 


Figure  10.5 


Possible  signals  u^(t),  .  .  . ,  u^(t)  have  identical  duration  I  and  may  arise 

only  at  the  corresponding  moments  in  time  t^,  .  .  . ,  tffl  known  at  the  point  of 

reception  (Figure  10.5).  At  each  moment  in  time,  only  one  of  these  signals  may 

exist  or  signals  may  be  absent  altogether,  i.  e.,  signal-plus-noise  y(t)  may  contain 

only  noise.  The  requirement  is  to  determine  whether  one  of  these  signals  exists 

at  system  input  or  only  noise  exists.  The  timer  (Figure  10.4)  cuts  in  receiver 

/7,  sequentially  for  the  intervals  when  anticipated  signals  are  active  to  solve 

this  problem  (i.  e.,  cuts  it  in  at  the  intervals  denoted  by  the  heavy  lines  in 

Figure  10.5)  and,  by  the  beginning  of  each  successive  interval,  insures  disappearance 

of  transient  processes  caused  by  oscillations  arising  in  the  receiver  during  the 

preceding  interval.  Then,  receiver  fk  output  voltage  at  moment  t^  +  T  turns  /155 

out  to  equal  Py(u1),  at  moment  t2  +  t  ,  it  equals  Py(u2),  and  so  forth.  Finally, 

at  moment  t  +  t  ,  this  voltage  turns  out  to  equal  P  (u  ). 

m  y  m 

Therefore,  for  formation  of  the  sum  of  the  inverse  probabilities  included 
in  inequality  (10.6),  voltages  corresponding  to  values  .  .  .,  f\,(um) , 

prior  to  summing  in  stage  £  ,  are  passed  beforehand  across  delay  circuits,  which 

cause  delays  at  intervals  Atj^,  At^,  and  so  forth,  respectively. 

Computation  of  detection  error  probabilities  P. *  T  and  requires  finding 
the  probability  of  satisfying  inequality  (10.6)  when  there  is  no  signal  and  when 
there  is  a  signal  present,  respectively.  The  left  side  of  this  inequality  has 
a  law  of  distribution  differing  from  the  norm  and  precise  determination  of  this 
law  in  the  general  case  presents  great  difficulties.  Therefore,  to  date  this 
problem  only  has  been  solved  approximately  and  for  the  simplest  cases. 

The  following  cases  were  examined  by  Peterson  and  others  El7]: 


1.  Signals  u^(t),  .  .  .,  u@(t)  are  precisely  known,  equally  probable,  orth¬ 
ogonal,  and  have  identical  energies  Q*. 

In  this  case,  energy  Q',  required  to  provide  given  error  probabilities  T 
and  P„p  are  determined  from  expression 


2 


(10.1 


where  Q  —  energy  of  a  precisely -known  signal  required  in  simple  binary  detection 
(i.  e.,  where  m  s  1)  to  provide  identical  error  probabilities  Pa  t  and  Pnp  . 

Where  Pnp  <  0,1  and  P«  T<0,1  ,  magnitude  Q/Nq,  in  accordance  with  formulas 
(5.31)  and  (5.31a)  equals 


*  l V^,n  - 1  -4  +  / ,n  i - 1  >4  )*• 


(10.1 


2.  Signals  u^(t)  have  the  form 


“a  (0  -  (/)  cos  («/  4  t). 

where  envelopes  a^t)  and  frequency  U  are  precisely  known,  while  phase  is 
random  and  equally  probable  within  the  range  from  0  to  2  7T  • 

Signals  u^(t)  are  equally  probable  and  have  identical  energies  Q',  while 
envelopes  ak(t)  are  orthogonal,  i.  e., 


0  where  !  +-  k. 


f -  j  JQ.  than 


Here,  requisite  energy  Q*  magnitude  is  determined  from  the  equation 


(10.1 


(10.1 


where  Qq  —  signal  energy  required  in  simple  binary  detection  (i.  e.,  where 
m  =  10 )  and  for  identical  error  probabilities  P.\  t  and  /’ .  Relationships  for 
determining  energy  Qq  were  given  in  §  9.2.  Where  P:,  T<0,l  and  P» p<0,l  ,  /156 
magnitude  is  determined  from  formula  (9.49). 

Relationships  (10.11)  and  (10.14) ,  obtained  in  [17],  are  approximate  since, 
during  their  derivation,  the  true  law  of  distribution  of  the  log  of  the  left  side 
of  inequality  (10.6)  was  replaced  by  a  normalized  law,  which  is  approximately 
valid  only  where  m  ^1. 

For  slight  error  probabilities  (Par  <  0.1  and  P np  <  0.1),  formulas  (10.11) 
and  (10.14)  may  be  simplified  significantly. 

Initially,  we  will  examine  formula  (10.11),  i.  e.,  the  case  of  a  precisely-known 
signal.  Where  m  ^  1,  Pa  t<0,I  ,  and  />np<0,l  ,  the  result  is  ^  \ 

and  from  (10.11)  we  have 


Sr  T,nm  + 


Q 

V 


Considering  relationship  (10.12),  finally  we  obtain 


(10.15) 


TTt={V  ]nT^~X'A+V  tn7i--l,7)%0.5lnm.  (10.16) 

Now  we  will  examine  formula  (10.14)  for  a  random-phase  signal. 

When  Pa  t<0,I  and  Pnps£0,l  ,  the  result  is  2Q0/N0^12j  here,  as  follows 
from  (10.14),  it  must  be  that 


l  H - Ig 

m 


>10*. 


therefore,  in  formula  (10.14),  it  is  possible  to  assume  with  great  precision; 


and 


Considering  these  relationships!  formula  (101.4)  is  reduced  to  the  form 

£-*St+o,5lnm.  (10.17) 

Since  magnitude  Q£J/Nq  is  determined  by  fortiula  (9.49),  then  it  is  possible 
to  write 

+  (10.18) 

As  f 0II0W8  from  what  has  been  stated,  formulas  (10.15)  and  (10.17)  [or  (10.16) 
and  (10.18)]  were  obtained  with  the  assumption  that  m^>l.  However,  it  follows 
from  expressions  (10.15)  and  (10.17)  that  they  provide  the  correct  result  also 
where  m  a  1  (Q*  a  Q  and  Q*  =  Qq,  respectively). 

Comparison  of  formulas  (10.16)  and  (10.18),  valid  for  precisely-known  /157 
and  for  random-phase  signals,  respectively,  demonstrates  that,  where  t<0,I 
and  /’up  <0.1  ,  phase  irregularity  will  lead  to  an  insignificant  (less  than 
2  db)  increase  in  requisite  signal  energy.  The  less  P *  T  and  P„ p  and  the  greater 
m,  the  less  the  impact  of  signal  phase  irregularity. 
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CHAPTER  ELEVEN 


/157 


DETECTION  AND  RECOGNITION  OF  SIGNALS  WITH  MANY  POSSIBLE  VALUES 
11.1  Optimum  Receiver  Structure 

In  Chapter  10,  a  case  was  examined  in  which  the  requirement  is  to  detect 
whether  or  not  any  of  m  possible  non-zero  signals  Uj(t),  .  .  um(t)  is  present 
at  input,  without  refinement  of  exactly  which  signal  is  present.  In  this  chapter, 
we  will  examine  a  case  where  the  requirement  is  not  only  to  determine  whether 
or  not  any  of  m  non-zero  signals  is  present  (i.  e.,  to  solve  the  detection  problem), 
but  also  to  indicate  exactly  which  of  the  possible  signals  is  present  (i.  e., 
to  solve  the  signal  recognition*  problem).  Therefore,  such  a  case  may  be  referred 
to  as  signal  detection  with  recognition.  This  occurs,  for  instance,  during  aircraft 
detection  by  a  radar  with  a  plan-position  indicator.  Actually,  such  a  radar  must 
determine  whether  or  not  there  is  an  aircraft  (or  any  other  desired  object)  within 
an  investigated  region  of  space  comprising  m  elementary  sectors  and,  if  there 
is,  then  in  exactly  which  of  these  sectors  it  is  located  (it  is  assumed  that  only 
one  aircraft  may  be  located  simultaneously  in  the  entire  region).  Here,  the  return 
from  an  aircraft  located  in  the  k-th  elementary  sector  may  be  considered  possible 


*Lately,  this  problem  often  is  referred  to  as  signal  classification;  the  term 
recognition  is  used  only  for  discrimination  of  complex  models. 


I 


signal  u^t)  and  the  radar  task  is  equivalent  to  the  multivalue  signal  detection 
and  recognition  task  formulated  above. 


As  indicated  in  Chapter  5,  solution  of  such  a  problem  with  minimum  composite 
error  probability  requires  comparison  between  themselves  of  inverse  probabilities 
P  (u  ),  •  •  •»  P  (u  )  possible  signals  and  selection  of  that  signal 

inverse  probability  turns  out  to  be  greatest. 

Here,  a  case  of  no  signal  (i.  e. ,  only  noise  at  input)  is  considered  /158 
presence  of  signal  uQ  identical  with  zero,  i.  e. 


«,(/)«-<). 


(11.1) 


All  possible  errors  are  considered  equally  dangerous  in  this  solution  to 
the  problem.  If  it  is  desirable  to  attach  different  weights  to  different  errors, 
then  it  is  necessary  to  compare  inverse  probabilities  taken  with  corresponding 
weights  7|o,  7|  .  .  .,  n,  i.  e.,  to  select  the  greatest  of  the  following 

magnitudes: 


n,  P,  («»).  *  Hi  P,  («i) . %  Pu  (uj. 


However,  for  simplicity,  in  future  all  weights  are  assumed  to  be  identical. 


Pf’tyU,/ 


VfPyfrk)  /A 


Figure  11.1.  (a)  —  P  [receiver];  (b)  —  SV  [sampling  circuit]. 

Than,  the  optimum  receiver  structural  diagram  may  be  represented  in  the  form  depicted 
in  Figure  11.1. 


In  this  figure,  inverse  probabilities  P  (u^), 


by  receivers  Hit 


nn 


■ ,  Py Cum>  are  computed 


,  while  inverse  probability  Py(uQ)  is  considered  some  constant 


bias  Uq.  Sampling  circuit  CB  selects  from  voltages  U^, 


•»  um»  equalling 


inverse  probabilities  Py(u^), 


, ,  Py(um)»  the  greatest  voltage,  designated 


«.  MX 

out  that 


Further,  this  voltage  is  compared  with  threshold  bias  UQ  and,  if  it  turns 


u»ux^U0,  i.  e. ,  ^bux  ^  0, 


(11.2) 


then  a  decision  is  made  that  no  signals  u^,  .  .  .,  um  are  present. 


In  the  opposite  case,  the  response  is  that  one  of  m  non-zero  signals  is  present 
at  system  input,  this  being  that  signal  to  which  greatest  voltage  value 
corresponds. 


Since  the  job  of  receiver  77*  is  to  compute  inverse  probability  P  (u.  ), 

y  k 


i.  e.,  solution  of  the  same  problem  as  was  the  case  in  the  detector  circuit  (Figure 
10.1),  then  their  structure  and  operating  mode  may  be  selected  so  that  receiver  77* 
may  be  built  in  accordance  with  the  Figure  10.2  circuit,  or  from  the  Figure  10.3 
circuit  in  the  case  of  a  fluctuating  signal.  However,  the  receiver  77*  structure 
in  Figure  11.1  may  be  simplifed  since,  here,  any  simultaneous  monotonic  /159 


nonlinear  conversion  of  voltages  U 


•  ui« 


Um  is  permissible. 


Actually,  in  the  Figure  11.1  circuit,  any  decision  made  is  because  each  of 


the  magnitudes  Uq, 


U 


V 


Um  turns  out  to  be  the  greatest.  Therefore,  if 


all  these  magnitudes  are  subjected  to  the  identical  inertia-free  nonlinear  trans¬ 
formation  of  the  type 


where  f(Uk)  --  any  monotonic  function,  and  transformed  magnitudes  U^'  are  compared 
among  themselves,  then  the  greatest  magnitude  wi11  be  transformed  into  the 
greatest  magnitude  U^'  and,  consequently,  the  result  of  the  comparison  remains 
unchanged.  Thus,  for  example,  it  is  evident  from  Figure  11.2  that,  if  magnitude 
IL  turns  out  to  be  the  greatest,  then  voltage  IL'  also  will  be  the  greatest. 


Figure  11.2 


Consequently!  it  is  permissible  in  the  optimum  detection  with  recognition 

system  (Figure  11.1)  to  use  any  identical  monotonic  transformation  of  all  output 

voltages  U, ,  .  .  . ,  U  ,  if  threshold  bias  U  here  also  is  subjected  to  the  same 
1  Ur  O 

transformation,  i.  e.,  to  adjust  the  magnitude  of  this  bias  accordingly.  Such 
system  invariance  to  random  monotonic  output  voltage  transformations  makes  it 
possible  in  several  cases  to  simplify  receiver  /7*  output  stages  considerably. 


For  example,  we  will  examine  the  case  of  fluctuating  signals.  Here,  in  accor¬ 
dance  with  (10.9),  we  have 


P,  («»)=*, 


/>(«*) 
,+  Qrp» 

N, 


(10.9) 


We  will  assume  that  all  signals  are  equally  probable  and  have  identical  energies 
Q.  Then,  it  is  possible  to  represent  expression  (10.9)  in  the  form 


Uk  —  Py  («*)  =Ce 


.i 

A,  Mk 


(11.3) 


where  C  —  constant  independent  of  the  k-th  signal.  Here,  the  receiver  //* 
circuit  has  the  form  depicted  in  Figure  10.3,  but  the  normalizing  stage  is  absent. 
Consequently,  receiver  /7,  may  comprise  optimum  filter  0</>* ,  linear  amplitude 
detector  JJ  ,  and  a  stage  with  a  characteristic  of  the  type  e*’ . 

In  accordance  with  what  was  stated  above,  it  is  possible  to  uee  any  monotonic 


transformation  to  magnitude  P  (u.)( identical  for  numbers  k).  In  this  case,  receiver 

y  K 

structure  simplification  requires  use  of  a  transformation  of  the  type 

u.  = 

This  transformation  is  monotonic  with  respect  to  and,  consequently,  is 
permissible. 

From  (11.3)  and  (11.4),  we  obtain 

U't  =h„Mk.  (11.5) 

Consequently,  instead  of  comparison  among  themselves  and  with  threshold  Uq 
of  type  (11.3)  expressions  U^,  it  is  possible  to  compare  much  more  simply  expression 

Figure  11.3  Figure  11.4 

KEY:  (a)  —  OF^  [optimum  filter  k]j  (b)  —  D  [detector], 

(11.5),  having  changed  the  threshold  here  from  UQ  to  IM.  This  denotes  that  it 
is  possible  to  remove  the  characteristic  of  the  type  e*'  from  the  Figure  10.3 
circuit  and  it  will  ‘■.ake  on  the  form  depicted  in  Figure  11.3.  In  this  circuit, 

U  i3  a  linear  amplitude  detector. 

Now,  we  will  examine  a  case  where  signals  u^Ct)  have  known  amplitudes  and 
random  equally-probable  phases.  Here,  receiver  /7*  structure  will  have  the  form 
depicted  in  Figure  10.2.  Again,  we  will  assume  that  all  signals  are  equally 
probable  and  have  identical  energies.  Then,  difference  stage  PK  in  the  Figure 
10.2  circuit  may  be  removed  and  receiver  /7*  will  comprise  optimum  filter  OQh  and 

an  envelope  detector  with  response  curve  of  the  type  Io(— Since  this  curve 


(11.4) 
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is  mo notonic  with  respect  to  msgnitude  changes,  then  it  is  possible  to  compare 

voltages  proportional  to  a^,  instead  of  voltages  proportional  to  , 

due  to  system  invariance  to  monotonic  transformations.  This  denotes  that  receiver 
in  the  Figure  11.1  system  may  be  built  in  accordance  with  the  Figure  11.4  layout. 

In  this  circuit,  R  is  a  linear  amplitude  detector.  Here,  threshold  bias  in  the 
Figure  11.1  circuit  must  be  changed  accordingly  (from  UQ  to  UQ'). 

Comparing  the  Figure  11.3  and  11.4  circuits,  obtained  for  a  fluctuating  signal 
and  for  one  with  known  amplitude,  respectively,  it  is  easy  to  become  convinced 
that'they  differ  only  in  factors  h^  and  a^.  If  signals  u^Ct)  have  not  only  identical 
energies  (as  was  accepted  above),  but  identical  duration  as  well,  then  factors 
h^  and  a^  will  not  depend  on  number  k  and,  therefore,  may  be  omitted  (given  the 
corresponding  adjustment  of  output  threshold  bias  Uq).  Here,  the  Figure  11.3 
and  11.4  circuits  coincide  completely.  This  denotes  that,  given  the  aforementioned 
assumptions,  receivers  flu  in  the  optimum  detection  with  recognition  system  (Figure 

11.1)  tum  out  to  be  identical  for  fluctuating  signals  and  for  those  with  known 
amplitude.  The  only  difference  is  threshold  bias  Uq  magnitudes.  Consequently, 
the  transition  in  the  Figure  11.1  circuit  from  optimum  reception  of  signals  with 
known  amplitude  to  optimum  reception  of  fluctuating  signals  may  be  made  /161 
without  any  changes  in  the  receiving  system  circuit,  with  the  exception  of  the 
corresponding  adj  ustment  of  output  bias  UQ. 

Thus,  with  the  assumptions  made  above,  an  optimum  detection  with  recognition 
system  has  invariance,  not  only  with  respect  to  monotonic  transformation  of  the 
system's  response  curve  shape,  but  also  to  a  change  in  the  law  of  signal  amplitude 
distribution  as  well.  This  property  makes  it  possible  to  make  the  system  simpler 
and  more  standard  than  is  the  case  for  optimum  detection  without  recognition  (Figure 

10.1) . 

11.2  Detection  and  Recognition  Error  Probabilities 

The  following  types  of  errors  are  possible  in  a  signal  detection  with  recognition 
system  (Figure  11.1): 

1.  False  alarms,  i.  e.,  responses  concerning  presence  of  some  signal  when, 


in  actuality,  there  is  no  signal  at  input.  P„  T  denotes  the  probability  of  this 
error. 


2.  Distortions,  i.  e.,  incorrect  responses  in  cases  when  there  is  some  signal 
at  input  [signal  misses  or  incorrect  indication  of  the  number  k  of  present  signal 
u^tt)].  PaeK  designates  the  probability  of  such  a  distortion. 


3.  It  follows  from  point  2  that  a  signal  miss  (i.  e. ,  a  response  concerning 
signal  absence  at  input  when,  in  actuality,  any  of  m  non-zero  signals  is  present) 
is  a  particular  case  of  distortion.  Therefore,  miss  probability  ^op  always  is  less 
than  or  equal  to  distortion  probability: 

P  <p  (11.6) 

Composite  error  probability  P0 m  and  composite  correct  response  probability 
pat>»»  corresponding  to  it  may  be  computed  from  formulas  (5.7)  and  (5.8).  Along 
with  composite  probability  f’npaa,  it  often  is  convenient  also  to  include  conditional 
correct  response  probability  PKn p««.  determined  for  a  condition  where  any  one  of 
the  non-zero  signals  is  present  at  input. 

Evidently, 


Pe  npia  “  l  ^neir  (11.7) 

i 

In  the  general  case,  error  probability  computation  represents  considerable 

* 

difficulties.  Therefore,  we  will  limit  ourselves  to  examination  of  a  rather- 
particular,  but  important,  case  meeting  the  following  conditions: 

^  a)  noise  is  additive  normal  white  noise; 

b)  all  non-zero  signals  u^(t),  .  .  .,  um(t)  are  equally  probable,  orthogonal, 
and  have  identical  energies.* 

1  This  case  was  examined  in  §  5.4  for  precisely-known  signals  and  in  the  /162 


*If  signals  u^(t),  .  .  .,  uffl(t)  have  parasitic  random  parameters,  then  the 
laws  of  distribution  of  these  parameters  for  all  signals  are  assumed  to  be  identical. 
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assumption  that  the  a  priori  zero  signal  probability  equals  zero.  In  this  section 
we  examine  a  more-general  case  where  signals  may  have  random  parameters,  while 
the  a  priori  zero  signal  probability  does  not  equal  zero. 

As  already  noted  in  Chapter  5,  signal  orthogonality  will  lead  to  the  fact 
that  their  distortion  by  noise  turns  out  to  be  statistically  independent.  In 


Figure  11.5.  (a)  —  Channel  number,  receiver  number; 

(b)  — — SV  [sampling  circuit]. 

addition,  it  also  was  shown  that  relationships  obtained  for  such  signals  turn 
out  to  be  valid  also  for  the  Figure  5.6  m-channel  receiving  device.  In  this  case, 
considering  that  the  signal  also  may  have  a  zero  value,  the  m-channel  receiving 
device  schematic  takes  the  form  depicted  in  Figure  11.5. 

As  before,  we  assume  that  additive  noise  voltages  «„,(/),  ....  uwm(0  active 
at  channel  input  have  normal  laws  of  distribution  with  zero  mean  values  and  identical 
dispersions  N  (here,  N  *  ).  Signal  uc(t)  (precisely  known  or  with  parasitic 

random  parameters)  with  equal  probability  may  be  present  at  input  of  any  of  m 

channels  or  be  absent  in  all  channels.  The  task  of  the  receiving  device  in  this 

case  comprises  determining  whether  signal  uc(t)  is  present  at  input  of  any  channel 
and,  if  so,  at  input  of  exactly  which  channel. 

All  Figure  11.5  designations  correspond  to  the  Figure  11.1  designations. 

Optimum  receivers  77, . . i/„  in  the  Figure  11.5  schematic  have  a  structure  identical 

to  that  of  the  Figure  11.1  system,  if  the  latter  is  designed  for  receipt  of  signals 


not  overlapping  over  time,  i.  e.f  if  signal  orthogonality  is  achieved  because 
they  do  not  overlap  over  time. 

Probabilities  P„T  and  Pnv  have  the  exact  concept  in  the  case  of  an  m-channel 
system,  as  is  the  case  of  m  possible  signals. 

I 

The  term  distortion  probability  Pncn  in  an  m-channel  system  is  understood 
to  mean  the  probability  of  an  incorrect  response  when  signal  uc(t)  is  present, 
i.  e.,  a  signal  miss  or  incorrect  indication  of  the  number  of  the  channel  in  which 
a  signal  is  present.  Here,  formulas  for  determination  of  error  probabilities  P»x< 
P„v  and  P„e„  in  both  cases  (i.  e.,  in  the  case  of  m  orthogonal  signals  and  in 
an  m-channel  system)  are  obtained  in  the  identical  way.  Therefore,  in  future  /163 


Figure  11.6.  (a)  —  P  [receiver];  (b)  --  LS  [arithmetical  unit]. 

for  precision  we  will  make  our  examination  relative  to  a  case  of  m  orthogonal 
signals,  i.  e.,  to  the  Figure  11.1  system.  Here,  it  becomes  convenient  for  analysis 
to  replace  the  optimum  detection  with  recognition  system  (Figure  11.1)  with  the 
system  depicted  in  Figure  11.6.  This  system  in  future  is  called  quasi-optimum 
since  probabilities  p«  *  and  pnp  in  it  are  obtained  in  exactly  the  same  manner 
as  in  the  optimum  system,  but  distortion  probability  p*c*  is  somewhat  greater. 

Receivers  flt . Ilm  and  threshold  bias  Uq  in  a  quasi-optimum  system  are  identical 

to  those  in  the  optimum  system,  but  bias  UQ  is  connected  at  the  output  of  each 
system  receiver  lh  rather  than  at  system  output. 

The  threshold  bias  overruns  of  all  channels  are  added  in  the  J1C  section, 


A U„=*Uh—U9  where  Uh>U9\  , 

At/k »0  where  Uk^U9. 

i 

Evidentlyt  uaH1  >0  is  always  the  case. 

If  uaHt  s  0)  then  it  is  considered  that  there  is  no  signal  at  system  input; 
if  <*at|]|  >0,  then  the  decision  is  that  there  is  a  signal  at  input  and  it  is  that 
very  signal  u^(t)  for  which  it  turned  out  that  >  Uq*  (here,  the  arrow  of  the 
appropriate  indicator  1^  dips). 

We  will  demonstrate  that,  in  this  principle  of  quasi-optimum  system  operation, 
false-alarm  and  miss  probabilities  arising  in  it  will  be  identical  to  those  in 
an  optimum  system,  while  distortion  probability  is  somewhat  greater. 

In  the  optimum  system  (Figure  11.1),  a  false  alarm  will  occur  if  u»m*>  /164 

is  the  case  when  there  is  no  signal  and  will  not  occur  if  <w  <  U*  ,  where 
u'n mi  is  the  greatest  of  voltages  U^,  .  .  . ,  Um< 

But,  if  the  greatest  of  voltages  U^,  .  .  . ,  Um  exceeds  UQ,  then  there  will 
be  a  false  alarm  in  the  quasi-optimum  system  (Figure  11.6)  as  well;  if  the  greatest 
of  voltages  U.,  .  .  .,  U  does  not  exceed  U  ,  then  there  will  be  no  false  alarm 
in  the  quasi-optimum  system  as  well  (Figure  11.6). 

Consequently,  esses  when  there  are  and  are  not  false  alarms  in  the  systems 
examined  (Figures  11.1  and  11.6)  always  coincide,  which  means  that  false-alarm 
probabilities  P*  »  also  coincide. 

♦If  threshold  UQ  is  exceeded  in  more  than  one  channel  when  there  is  a  signal, 

than  there  is  no  signal  number  indication  (i.  e.,  recognition)  and  it  is  assumed 
that  a  distortion  is  occurring. 


Analogously,  one  may  show  that  miss  probabilities  in  both  systems  (Figures 
11.1  and  11.6)  also  coincide. 

We  now  will  compare  distortion  probabilities  P„c„. 

It  follows  from  (11.7)  that,  instead  of  distortion  probabilities  PMh  ,  it 
is  possible  to  compare  among  themselves  conditional  probabilities  Pc np*s,  i.  e., 
probabilities  of  correct  response  for  the  condition  where  any  one  of  m  signals 
is  present  at  input. 

In  the  Figure  11.1  system,  when  signal  number  k  is  present,  a  correct  response 

is  dependent  on  voltage  Uk  at  appropriate  receiver  flk  output  being  greater  than 

at  the  output  of  the  remaining  receivers  and,  in  addition,  it  must  exceed  threshold 

U  . 
o 

In  the  quasi-optimum  system  (Figure  11.6),  when  signal  number  k  is  present, 
a  correct  response  depends  not  only  on  receiver  output  voltage  being  greatest 
and  exceeding  threshold  Uq,  but  also  on  the  output  voltages  of  all  remaining  receivers 
being  less  than  UQ. 

Consequently,  the  conditions  for  obtaining  a  correct  response  are  stricter 
in  the  quasi-optimum  system  than  in  the  optimum  system  and  they  will  be  met  in 
fewer  cases.  Therefore,  conditional  correct  response  probability  Pe  npa. is  less 
in  a  quasi-optimum  system  than  in  the  optimum  system. 

Thus,  if  fal8e-alarm,  miss,  and  distortion  probabilities  in  the  optimum  system 
equal  Pn  t,  Pa p  and  Pnnm  ,  then,  in  the  quasi-optimum  system,  the  probabilities 
of  false  alarms  and  misses  will  be  identical,  while  distortion  probability  P^  will 
be  less,  more  precisely 

P*c,>Pwt»-  (11.9) 

We  will  compute  error  probabilities  in  the  quasi-optimum  system  (Figure  11.6). 
This  system  will  comprise  m  channels,  each  comprising  a  receiver  /7*  ,  threshold 
bias  U  .  and  bias  overrun  indicator  I.  , 


We  will  introduce  probabilitiee  P*  T  „  and  Pnv  k  for  each  such  channel,  having 
determined  them  in  the  following  manner: 

Pa  t  a  is  the  probability  of  bias  Uq  overrun  in  the  k-th  channel,  when  there 
is  no  signal  with  this  number  at  input; 

P„p  ,  is  the  probability  that  bias  UQ  in  the  k-th  channel  will  not  be  /165 
exceeded  when  signal  uk(t)  with  this  number  is  at  system  input. 

Initially,  we  will  show  that  probabilities  P*  T  t.  and  Pnp  h  determined  in 
this  manner,  in  the  case  of  orthogonal  signals,  coincide  with  the  false-alarm 
and  miss  probabilities,  respectively,  in  each  channel,  given  simple  binary  detection. 

If  the  k-th  channel  is  operating  in  the  simple  binary  detection  mode,  then 
the  false-alarm  probability  in  this  channel  is  the  probability  that  threshold 
UQ  will  be  exceeded  at  its  output  when  only  noise  is  present  at  input.  Probability 
Pa  T »  ,  as  accepted  above,  is  determined  for  a  condition  where  a  signal  only  of 
given  number  u^(t)  is  absent  at  the  k-th  channel  input.  This  signifies  that, 
given  determination  of  amplitude  P„  T  k  at  the  input  of  a  given  numbered  channel, 
a  signal  of  any  other  number  1  may  be  present,  where  1  i  k  (or  there  may  be  no 
signal  at  all).  However,  since  all  signals  are  orthogonal,  they  do  not  overlap 
with  respect  to  time  or  to  frequency  spectra,  for  instance,  then  a  signal  only 
of  given  number  k  may  pass  through  receiver  /7*  ,  while  signals  of  all  remaining 
numbers  exert  no  impact  whatsoever  on  this  receiver's  output  voltage  (nor  on  the 
probability  of  a  threshold  UQ  overrun  or  non-overrun).  Thus,  for  example,  if 
signals  do  not  overlap  with  respect  to  time,  then  each  receiver  /7»  opens  only 
for  a  time  interval  from  t^  to  +  ^"k»  in  which  only  anticipated  signal  uk(t) 

may  be  active.  During  action  of  any  remaining  anticipated  signals,  receiver 
//*  will  be  blanked. 

If  signal  orthogonality  is  achieved  so  that  they  do  not  overlap  with  respect 
to  frequency  spectra,  then  receivers  /7ft  have  nonoverlapping  bandwidths  in  all 
input  networks  and  the  spectra  of  all  anticipated  signals  except  1^(0  fail  to 
reach  the  bandwidth  of  each  receiver  77*  . 

Thus,  only  noise  and  signal  u^(t)  may  impact  upon  receiver  17^  output  voltage, 
and  signals  u^(t)  (where  1  i  k)  exert  no  impact  whatsoever.  Hence,  it  follows 
that  probability  P»  T  * eqtmls  the  false-alarm  probability  in  the  k-th  channel, 


given  simplex  signal  u^(t)  binary  detection.  It  will  not  be  difficult  analogously 
to  become  convinced  that  probability  P«p  equals  the  signal  miss  probability  in 
the  k-th  channel,  given  simple  signal  u^(t)  binary  detection. 

False-alarm  and  miss  probabilities  for  simple  binary  detection  were  found 
earlier  in  Chapter  9  and  simple  formulas  were  obtained  for  certain  signal  types. 
Thus,  for  example,  formula  (9.83)  is  valid  for  simple  binary  detection  of  a  fluc¬ 
tuating  signal  and,  in  this  case,  may  be  written  in  the  form 

.  i 

Qep  _  Pa  r  k  • 

X  T  i - '•  (11.10) 

where  Qep  —  signal  uk  (  )  average  energy  (since  it  was  accepted  that  all  /166 
possible  signals  have  identical  energies,  then  will  not  depend  on  number  k). 

Consequently,  if  the  permissible  error  probabilities  P*  T»  and  P„ p  x  in  each 
channel  are  known,  then  the  corresponding  simple  binary  detection  formulas  may 
be  used  to  determine  requisite  signal  energy.  Therefore,  it  remains  only  to  find 
the  link  between  the  error  probabilities  in  each  channel  and  in  the  system  as 
a  whole,  i.  e.,  the  relationship  of  probabilities  Pa  T.  Pnp  and  Puen  to  Pa  t  » 
and  P op »  .  To  do  so,  we  again  turn  our  attention  to  the  Figure  11.6  diagram 
and  we  will  consider  that,  in  this  case,  when  all  signals  are  accepted  as  being 
equally  probable  with  identical  average  energies  and  identical  laws  of  parasitic 
random  parameter  distributions,  probabilities  P„  T  k  and  P  „p  *•  in  all  channels 
are  identical,  i.  e.,  will  not  depend  on  number  k  (in  spite  of  this,  we  will  retain 
index  x  in  order  to  underscore  that  the  given  probabilities  relate  to  one  system 
channel,  and  not  to  the  system  as  a  whole). 

Initially,  we  will  find  false-alarm  probability  P«  r  or,  which  is  more  con¬ 
venient,  probability  (*  —  »)  of  the  absence  of  a  false  alarm  in  the  system. 

In  order  for  there  to  be  no  false  alarm  in  the  examined  system,  there  is 
a  requirement  that  the  threshold  not  be  exceeded  in  any  one  of  the  channels  when 
there  is  no  signal,  i.  e.,  that  there  be  no  false  alarms  in  any  channel.  But, 
the  probability  of  the  absence  of  a  false  alarm  in  a  given  channel  equals 


(1  —  P«  t  h)  ,  while  errors  in  all  channels  statistically  are  independent  in  light 
of  the  aforementioned  assumptions;  therefore 

0-P,, „)m  and  1-0 -/>„*)".  (11.11) 

Now,  we  will  find  miss  probability  P op  .  Evidently 

Ppp  =  Pe(«l)Plop+Pe(«*)  Pt„p  +  ...  +  Po(«m)Pmnp.  (11*12) 

where  PQ(u^),  •  •  •»  PQ(um)  —  a  priori  probabilities  of  the  presence  of  signals 
Uj,  .  .  . ,  uffl,  respectively,  with  the  condition  that  any  one  of  these  signals 
must  be  present,  i.  e., 


Pfl(wi)  +  --  +  Pe  (««)=*  I.  (11.13) 

The  designation  Pk  is  used  for  the  signal  miss  probability  in  the  system 
for  a  condition  where  signal  u^(t)  is  present  at  its  input  (do  not  confuse  conditional 
probability  Pk  np  ,  relating  to  the  system  as  a  whole,  with  probability  PDp  *. 
relating  to  one  of  this  system's  channels). 

In  light  of  the  assumptions  made  above  concerning  the  nature  of  the  signals, 
we  obtain 


P lop  P *np  ”  •••  —  Pm  np* 

therefore,  formula  (11.12)  provides 

PBp  =  Pup  [?>«)+  .  +  P«(«JI- 

Considering  normality  condition  (11.13),  we  obtain 


Pfip  ~  P ! 


Inp* 


(11.14) 


i.  e.,  for  the  signals  being  examined,  when  computing  miss  probability  P„v  /167 
in  the  system  as  a  whole,  one  may  assume  that  signal  u.(t)  is  present  at  system 


When  signal  u^(t)  is  present,  a  miss  occurs  in  the  system  (Figure  11.6)  if 
threshold  Uq  is  not  exceeded  in  any  one  of  the  system's  m  channels.  But,  with 
respect  to  the  determination,  the  probability  that,  given  signal  u^(t)  presence, 
the  threshold  will  not  be  exceeded  in  the  first  channel  is  P„ p ,  ,  meanwhile 
F\iP  i pn p  h  .  Given  signal  u^(t)  presence,  the  probability  that  the  threshold 
will  not  be  exceeded  in  the  k-th  channel  (where  k  i  1)  is  I  —  P„  T  * •  Therefore, 
in  light  of  the  statistical  independence  of  the  errors  of  individual  channels, 
the  probability  that,  given  signal  u^(t)  presence,  the  threshold  will  not  be  exceeded 
in  any  one  of  the  channels  equals  (I  —  P„  T  ft)™-1  ;  the  probability  that  it,  in 

addition,  will  not  be  exceeded  in  the  first  channel  equals 
Pap  »  (1  —  Pn  r  ft)™-1  5  consequently, 

Ap»ApftO -At  <n-15> 

We  now  will  find  distortion  probability  Am  .  In  this  case,  it  is  more 
convenient  first  to  find  the  probability  of  no  distortion 

Anp*»“  1— Am*  (11.16) 

where  Pe nP»  and  Pan  —  conditional  probabilities  determined  for  the  condition 
that  one  non-zero  signal  (it  is  immaterial  exactly  which  one)  is  present  at  system 
input. 


Consequently,  A  km*  is  the  probability  of  a  correct  response  for  the  condition 
that  only  one  of  the  non-zero  signals  is  present  at  system  input.  Evidently, 

Ptapau  “  A  (Ml)  Anpftt  +  A  (Ht)  P *nw»  +  •••  +  A  (Hm)  Pm  np«i*  (11.17) 

where  Pft npaa—  probability  of  a  correct  response  for  the  condition  that  signal 
is  present  at  system  input. 

In  light  of  the  aforementioned  assumptions  concerning  the  signal  nature, 
this  condition  must  be  met 


Anpa*  *“  Anp«»  “  •••  **  An p«»- 


Considering  this  relstionship  and  normality  condition  (11.13),  we  obtains 

Penpal  ^1  npn*  (11.18) 

i.  e. ,  it  is  possible  to  compute  correct  response  probability/^ ni.a.in  this  case 
for  the  condition  that  signal  u^(t)  is  present  at  system  input. 

In  order  for  a  correct  response  to  be  supplied  here,  it  is  necessary  that 
threshold  Uq  be  exceeded  in  the  system's  first  channel  (Figure  11.6),  while  no 
threshold  overrun  occurs  in  all  remaining  channels.  But,  given  presence  of  signal 
u1(t),  the  probability  of  threshold  UQ  overrun  in  the  first  channel  equals 
(1  —  P np  *)  ,  while  the  probability  of  a  threshold  non-overrun  in  all  remaining 
channels  equals  (1  —  Pa  t  therefore,  the  probability  of  coincidence  /168 

of  all  aforementioned  events  (considering  their  statistical  independence)  equals 

and,  considering  (11.16),  finally  we  obtain 

P»«  =  l— (I -Pnp*)(,-P,  (11.19) 

Formulas  (11.11),  (11.15),  and  (11.19)  establish  the  link  between  the  error 
probabilities  in  the  system  as  a  whole  (  P «  *,  PM  and  P'mt* )  and  in  each  channel 
of  this  system  (  P*  T»  and  P* p*  ).  Therefore,  based  on  these  formulas,  it 
is  possible  with  respect  to  the  given  (permissible)  error  probabilities  in  the 
system  as  a  whole  to  determine  permissible  probabilities  Pn  r  *  and  P  „„  *in  each 
channel,  i.  e.,  fa Is e-alarm  and  miss  probabilities  during  simple  binary  detection. 
Based  on  magnitudes  P»  t  »  Pgp  k  found,  using  the  simple  binary  detection  formulas 
[formula  (5.31),  for  example],  it  is  possible  to  determine  requisite  signal  energy 
Q  (  Qt p  for  a  fluctuating  signal).  Given  such  energy,  error  probabilities 
Pn  t.  Pm p  and  P'wtn  occur  in  the  quasi-optimum  system  (Figure  11.6). 

As  indicated  above,  given  such  signal  energy,  false-alarm  and  miss  probabilities 
in  the  optimum  system  (Figure  11.1  or  11.5)  will  be  exactly  the  same,  while  distortion 
probability  Pam  will  satisfy  inequality  (11.8),  i.  e., 


Consequently!  the  computational  procedure  indicated  above  makes  it  possible 
relatively  easily  to  determine  probabilities  P  and  Pnp  in  the  optimum  system 
and  to  find  the  upper  limit  for  distortion  probability  Pntn  .  Especially-simple 
and  clear  relationships  result  if  error  probabilities  P  at*  P np  and  P hck  (and,  conse¬ 
quently,  /»„„)  are  sufficiently  slight.  Therefore,  we  will  tum  to  examination 
of  this  case. 

11.3  The  Case  of  Slight  Error  Probabilities 

We  will  assume  that  the  false-alarm  probability  is  sufficiently  slight,  and 
precisely  is 

0,1;  (11.21) 

here,  from  (11.11)  with  less  than  5*  error  we  obtain: 


Considering  relationships  (11.21)  and  (11.22),  it  is  possible  with  less  /i69 
than  10%  error  to  assume 

Pup  Pnp  h-  (11.23) 

With  about  the  same  accuracy,  from  (11.19)  we  have 


P «««  +  mV 


(11.24) 


Where  m^^lO,  it  is  possible  to  assume 


^HCK~  P np-|-  P*T 


(11.24a) 


It  follows  from  formulas  (11.24)  and  (11.24a)  that,  for  given  probabilities 
P np  and  P  *  t  ,  permissible  probability  Puck  already  may  not  be  selected  randomly, 
but  must  be  determined  from  these  formulas.  If  you  assume  that  Pn  T<  0,1  and 
P np<0,l  ,  then  here,  the  following  always  will  be  the  case 


P He*  0.2- 


(11.25) 


As  indicated  above,  finding  requisite  signal  energy  means  finding  permissible 
probabilities  ^hp  *  and  T*in  a  simple  binary  detection  channel  using  givens 
/’up  and  P.i  t  f  then  substituting  them  into  the  corresponding  simple  binary  detection 
formulas. 


In  this  case,  as  follows  from  (11.22)  and  (11.23),  we  obtain 


^ap*=^(ip'  Pnrk~~  f||  *  (11.26) 

thus,  the  formula  for  signal  energy  for  detection  with  recognition  of  m  possible 
signals  may  be  obtained  from  the  corresponding  simple  binary  detection  formulas, 
if  Pn  T  is  replaced  by  P„  ,/m  in  the  latter. 

The  following  simple  binary  detection  formulas  were  obtained  in  preceding 
chapters: 

a)  for  a  precisely-known  signal, 


Q_ 


(5.31a) 


b)  for  a  random-phase  signal 


I 


c)  for  a  fluctuating  signal 


Qcp 

n9 


i 

P«  T  ‘ 


Replacing  Pn  r  with  thn  in  these  formulas,  we  obtain  the  corresponding  formulas 
for  detection  with  recognition  of  m  possible  signals: 

a)  for  precisely-known  signals  /170 


i 

I 


Q_ 


ln/n+  in  -i - 1,4  -f 

Pn  t 


* 


b)  for  random-phase  (equally  probable)  signals 


Q_ 

N* 


Inm  +  ln  —+  I /  ln-i - 1,4  )  ; 

Pit  t  I  Pnp  / 


c)  for  fluctuating  signals 


(11.27) 


(11.28) 


(11.29) 


These  formulas  make  it  possible  to  determine  energy  Q  (or  Qcp  )  of  each  signal 
U|((t)  required  for  detection  with  recognition  of  m  possible  signals  with  given 
error  probabilities  Pn  r  and  Pnp  . 

Since  values  Pn  1  and  Pn 9  are  identical  for  optimum  and  quasi-optimum  systems, 
then  formulas  (11.27)— (11.29)  in  equal  measure  are  valid  for  both  systems. 


The  difference  between  optimum  and  quasi-optimum  systems,  as  was  demonstrated 
in  §  11.2,  consists  only  of  the  fact  that,  for  given  magnitudes  Pn  T  and 

Pnp  >  the  1'istortion  probability  in  a  quasi-optimum  system  is  determined  from 
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formula  (11.24),  while  distortion  probability  /’■««  satisfies  inequalities  (11.6) 
and  (11.20),  i.  e.,  range  from 


Pap  ^  Puea  ^  ^np  +  ^  m  j  P«  t- 

From  formulas  (11.24)  and  (11.30)  we  have 


1< 


I  — 


J_\  Pg  r 
m  Pap 


(11.30) 


(11.31) 


It  follows  from  these  relationships  that  the  difference  between  the  distortion 
probabilities  in  quasi- optimum  and  optimum  systems  is  less,  the  less  ratio  Pg  ^Pnp> 
and,  given  Pa  JPa p^O.l,  it  is  possible  to  assume: 


(11.32) 


This  result  is  very  understandable.  Actually,  it  was  demonstrated  in  §  11.2 
that  resultant  /’■«  is  greater  than  Puen  only  because,  in  a  quasi-optimum  system, 
obtaining  the  correct  response  when  signal  uk(t)  is  present  requires,  along  with 

everything  else,  that  output  voltages  of  all  receivers  /7, . I7m  ,  with  the  exception 

of  /7ft,  not  exceed  threshold  UQ  (Figure  11.6).  In  the  optimum  system  (Firjure 
11.1),  there  is  no  requirement  that  this  condition  be  met  in  order  to  obtain  the 
correct  response.  Hence,  it  follows  that  the  shortcoming  of  a  quasi-optimum  system 
compared  with  an  optimum  system  (i.  e.,  the  />m  overrun  over  />M„)  must  be  greater, 
the  lower  threshold  UQ.  Where  Uo““*  oo ,  this  shortcoming  asymptotically  /171 
disappears,  but,  when  UQ  *  0,  the  qua si- optimum  system  will  become  completely 
useless.  Actually,  if  UQ  a  0  is  selected,  then  a  UQ  threshold  overrun  essentially 
always  occurs  in  all  Figure  11.6  system  channels  (i.  e.,  dips  in  all  indicators 
1^,  .  .  .,  Im)  and,  as  a  result,  it  almost  always  is  impossible  to  indicate  which 
possible  signal  u^  is  present  at  input. 


Thus,  the  lower  UQ,  the  greater  ratio  P'atafPmea  must  be.  But,  the  lower  Uq, 
the  greater  ratio  PmyfPmv  •  Actually,  if  UQ  a  0,  then  false  alarms  always  occur 
in  the  system  and  there  are  no  misses,  i.  a.,  P„  r!Pnv-*.  «>.  If  UQ — *  oef  then 
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there  always  are  misses  in  the  system  and  there  are  no  false  alarms,  i.  e., 

Pa  vJPnp -*■  0  .  Consequently,  when  bias  UQ  changes  from  infinity  to  zero,  ratio 
Pa  t/Pbp  rises  from  zero  to  infinity  and,  with  UQ  inspection,  it  is  possible 
to  obtain  any  ratio  P.i  ^Pap  value. 

Thus,  it  follows  from  a  purely  qualitative  examination  that  an  increase  in 
ratio  Pa  T^apmeans  that  ratio  PteJPnc*  also  must  increase.  Inequality  (11.31) 
provides  quantitative  confirmation  of  this  result. 

The  ratios  presented  in  this  section  make  it  possible  very  easily  to  determine 
requisite  signal  energy  for  both  quasi-optimum  and  optimum  systems. 

Actually,  knowing  the  number  m  of  possible  signal  values  and  having  been 
given  permissible  error  probabilities  P*  ,  and  P„p  ,  it  is  possible  to  use  a  simple 
binary  detection  formula  for  the  appropriate  signal  type  Cone  of  the  (11.27)— (11.29) 
formulas,  for  instance]  to  determine  requisite  signal  energy  Q  (or  Qep).  Given 
such  energy,  distortion  probability  is  determined  in  a  quasi-optimum  system  from 
formula  (11.24)  and  from  inequality  (11.30)  for  an  optimum  system.  Inequality 
(11.30)  in  the  general  case  does  not  permit  determination  of  a  precise  distortion 
probability  value  in  an  optimum  system  since  it  indicates  only  the  limits  in  which 
the  value  is  bounded.  However,  in  many  practical  instances,  knowledge  of  these 
boundaries  turns  out  to  be  fully  sufficient.  Thus,  for  example,  it  follows  from 
formula  (11.30)  that,  where  Pn  T^0.l/>np,  it  is  possible  with  less  than  10%  error 
to  assume: 


^■e«  *  P*t*  *■  P^p  +  ^  I  —  ~  ]  Pa  t- 


(11.32a) 


There  is  absolutely  no  requirement  for  precise  knowledge  of  magnitude  PneH 
for  several  signal  types  and,  when  determining  requisite  signal  energy,  we  will 
assume  a  variation  in  magnitude  Pn,n  selection  by  a  factor  of  several  unities  and 
more.  In  particular,  this  postulation  occurs  in  the  case  of  precisely-known  and 
random-phase  signals.  Actually,  formulas  (11.27)  and  (11.28)  are  valid  for  these 
signal  types  and  it  is  possible  to  assume  (where  P„  T<  0,1  and  pnp  <  0,1  )  with 
slight  error: 


T.~(l  lBm+,*,-; +|/  In— J 


We  will  assume 


(11.33) 
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—  -• 
Pup 


(11.34) 


where  —  some  random  number.  Then,  it  follows  from  (11.30)  that 


(11.35) 


Using  inequality  (11.35)  for  magnitude  Pnc*  determination,  let  us  make  the 
maximum  possible  error,  i.  e.,  instead  of  assuming 


Pfien  ^np» 


(11.36a) 


we  assume 


+  “■)]P*r  (11.36b) 

We  will  clarify  how  this  will  be  reflected  in  requisite  signal  energy.  In 
the  first  case,  from  (11.33),  (11.34),  and  (11.36a)  we  obtain: 


lnm  +  In 


In  the  second  case,  from  formulas  (11.33),  (11.34),  and  (11.36b)  we  have 


Consequently, 
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(11.37) 


It  was  assumed  during  derivation  of  this  formula  that,  in  accordar,  e  with 
(11.34) 


\P IIP  P.i  T* 


Since  l\,  r  ^  1  is  always  the  case,  then  T^np<I  must  be  true.  Considering  formula 
(11.36a),  we  have 
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(11.38) 


Consequently,  formula  (11.37)  is  valid  only  for  those  parameter  'J'  /173 

values  that  satisfy  condition  (11.38). 


Formula  (11.37)  demonstrates  the  factor  by  which  requisite  signal  energy 
changes  if  the  maximum  possible  error  is  made  when  determining  distortion  probability 
i.  e.,  the  upper  rather  than  lower  limit  is  selected  in  inequality  (11.35). 

It  f olio wa  from  formula  (11.37)  that,  the  smaller  pexmissible  distortion 
probability  ,  the  less  the  difference  between  Qj  and  Q^,  i.  e.,  the  less  impact 
the  error  has  on  magnitude  determination.  The  Figure  11.7  curves  are  based 
an  formula  (11.37).  It  is  evident  from  the  curves  that,  where  Pmen  -  0.1  and 
m  =  2,  the  error  in  requisite  energy  determination  does  not  exceed  1  S 
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Figure  11.7 


(««'•»)•  if  /<  1.8.  In  a  case  more  typical  for  radar  (m  =  lo\ 

Pwen  s  10"2) ,  the  error  does  not  exceed  1  di  where  6.  It  is  evident 

from  these  examples  that  it  is  possible  to  use  relationship  (11.32a)  for  precisely- 
known  and  for  random-phase  signals,  i.  e.,  to  assume 

not  only  where  >;  |  .  but  .1M  where  or  wm  where  <iT-  r  . 

P»9  r»»  P<W 

Thus,  for  many  esses  interesting  free  e  practical  point  of  view,  the  energy 
required  in  a  quesi-optimua  system  to  provide  given  error  probabilities  is  approx¬ 
imately  the  same  as  that  in  an  optimum  system  and,  consequently,  it  is  possible 
to  use  the  same  computational  formulas  for  both  ay sterna. 

As  alrea<V  noted  in  $  11.2,  the  error  probeblitias  in  the  case  of  an  optimum 
m-chaimel  system  (Figure  11.5)  are  the  same  as  in  the  case  of  an  optimum  system 
of  detection  with  recognition  of  m-orthogonal  signals  (Figure  11.1).  Therefore, 
computational  formulations  presented  above,  formulas  (11.27) — (11.32)  in  particular, 
are  valid  for  an  optimum  m-channel  system  as  wall  (Figure  11.5). 

In  conclusion,  it  should  be  underscored  that  it  was  assumed  throughout  this 
section  that,  in  the  case  of  m  possible  signals  u^(t),  .  .  .,  uffl(t)  at  system 
input  (Figure  11.1),  only  one  of  these  signals  nay  be  present  during  each  sequence. 


This  is  equivalent  in  the  case  of  an  m-channel  system  (Figure  11.5)  to  the  fact 
that  a  signal  may  be  present  only  in  one  of  its  channels. 

In  a  more  complex  case  where  signals  may  be  present  simultaneously  in  several 
channels,  results  may  differ  significantly.  For  instance,  in  an  m-channel  system 
(Figure  11.5)  let  signals  appear  independently  in  any  number  of  channels  of  this 
system.  This  signifies  that,  if  it  becomes  known,  for  example,  that  there  /174 
is  a  signal  at  input  of  the  k-th  channel,  then  this  will  not  impact  in  any  way 
on  the  probability  of  signal  presence  or  absence  in  the  remaining  channels.  Since 
noise  in  different  channels  also  is  assumed  to  be  statistically  independent,  then 
information  on  the  nature  of  the  voltages  in  the  remaining  channels  plays  no  role 
whatsoever  in  solving  the  problem  of  whether  or  not  a  signal  is  present  in  a  given 
channel. 

Consequently,  optimum  solution  of  the  overall  problem  requires  each  channel 
to  provide  an  independently -optimum  solution  of  the  simple  binary  detection  problem, 
i.  e.,  to  operate  in  the  same  mode,  as  occurs  in  the  Figure  11.6  system.  This 
denotes  that,  for  the  given  conditions,  the  Figure  11.6  system  is  converted  from 
a  quasi-optimum  into  an  optimum  system  (the  assumption  here  is  that  there  are 
signals  in  those  channels  where  threshold  (JQ  is  exceeded,  i.  e. ,  indicators  1^ 
dip,  and  there  are  no  signals  in  the  remaining  channels).  The  Figure  11.1  system 
is  based  on  selection  of  the  greatest  voltage  and,  for  the  given  conditions, 
is  unacceptable. 

11.4  Signal  Detection  With  Recognition  Compared  To  Complex  Binary  Detection 

We  will  compare  signal  energy  required  for  detection  with  recognition  of 
m  possible  non-zero  signals  to  the  energy  required  in  complex  binary  detection 
of  such  signals. 

Since,  in  the  second  case,  the  requirement  is  only  signal  detection,  without 
recognition  of  exactly  which  of  the  signals  is  present  (if  a  signal  is  present), 
then  a  smaller  volume  of  information  will  be  subject  to  reproduction  (reception) 
than  in  the  first  case.  Therefore,  solution  of  such  a  problem,  all  other  condition8 
being  equal,  requires  less  signal  energy  than  for  simultaneous  signal  detection 
with  recognition.  We  will  clarify  whet  savings  in  signal  energy  may  be  obtained 
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We  will  assume  for  comparison  of  detection  to  simultaneous  detection  and 
recognition  that,  in  both  cases,  identical  detection  error  probabilities  P*  » 
and  ^np  are  permissible.  Here,  sicpial  recognition  distortion  probability 
Pme*  is  determined  from  relationship  (11.30). 

Then,  fi  .  formulas  (10.16),  (10.18),  (11.27),  and  (11.28),  we  will  obtain 

a)  for  precisely -known  signals 


9  M+|/ 

(|/  ,n  M  +  \/  ,n —  * *4)  +0,5,n'B 

b)  for  random- phase  signals 


(11.40) 


Q_ 

Q' 


W  +0-‘te" 


(11.41) 


Ratio  Q/Q*  demonstrates  the  factor  by  which  requisite  signal  energy  increases 
during  the  transition  from  detection  to  simultaneous  detection  and  recognition. 

To  simplify  the  analysis,  initially  we  will  accept  that  the  follwing  rela¬ 
tionships  (which  usually  are  the  case  in  practice)  are  satisfied: 

and  In -J- +  In m  ^  In— .  (11.42) 

P*i  Pa  T  P|»P 


Then,  relationships  (11.40)  and  (11.41)  coincide  and  take  the  following  form: 


kKMsucfeisy 


Consequently!  the  result  for  any  ratio  -niVJ?*_Tl  i8 


i.  e.f  discontinuance  of  recognition  provides  a  signal  energy  savings  of  less 
than  3  d8. 


If,  instead  of  (11*42),  one  assumes 


'"T — 

Pm t 


(11.44) 


then,  relationships  (11.40)  and  (11.41)  take  the  form 


q  |  4+to-H/nT 
Q'  «+* 


(11.45) 


where 


Ml/W 


Here,  the  result  again  is 


Finally,  if  one  assumes  in  addition  to  condition  (11.42)  [or  to  (11.44)] 


In  — i—  >  Inm, 

Pat 


(11.46) 


than,  from  (11.43)  [or  from  (11.45)],  we  will  obtain  Q/Q*  *  1. 


Thus,  the  following  basic  results  are  obtained  for  precisely-known  and  random- 
phase  signals: 

1.  In  the  majority  of  cases  of  practical  interest  [i.  e.,  when  condition 
(11.42)  or  (11.44)  is  met],  discontinuance  of  signal  recognition  provides  a  savings 
in  requisite  signal  energy  less  by  a  factor  of  2. 

2.  The  result  for  a  sufficiently-slight  false-alarm  probability  [and,  precisely 
when  condition  (11.44)  is  met]  is  Q  Q',  i.  e.,  discontinuance  of  signal  recognition 
does  not  provide  any  savings  in  requisite  signal  energy. 

The  second  result,  as  L.  R.  Dobrushin  pointed  out  [79],  occurs  for  fluctuating 
signals  as  well,  i.  e.,  in  the  case  of  fluctuating  signals,  it  also  is  possible 
to  assume  that  Q  «*  Q’  if  false-alarm  probability  Pn  T  is  sufficiently  slight. 


V. 
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CHAPTER  TWELVE  /177 

SPECIAL  FEATURES  OF  PULSE  TRAIN  (MPACKET")  DETECTION  AM)  RECOGNITION 
12.1  General  Nature  of  Pulse  Trains 

Let  possible  signal  u  (t)  be  a  train  of  n  pulses,  as  depicted  in  Figure  12.1. 

C 

For  brevity,  this  signal  u8ually  is  referred  to  as  a  pulse  packet. 

Each  pulse  u^(t)  of  such  a  packet  completely  is  characterized  by  amplitude 
a^,  frequency  f^,  moment  of  an8et  t^,  initial  phase  at  moment  of  onset  cj^,  and 


Figure  12.1 


duration  T ^  If  the  relationshipe  between  the  parameters  of  all  packet  pulses 
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are  precisely  known  at  the  point  of  reception,  then  such  pulses  and  such  a  pulse 
packet  are  called  coherent.*  Otherwise,  the  packet  is  called  incoherent.  It 
follows  from  determination  of  a  coherent  packet  that,  if  even  one  of  the  packet 
pulses  becomes  known  at  the  point  of  reception,  for  example  parameters  a^,  f , , 
and  T ^  of  the  first  pulse,  then  entire  packet  u  (t)  thereby  will  become 

known. 

Consequently,  a  coherent  pulse  packet  has  exactly  the  same  maximum  possible 
number  of  unknown  parameters  as  does  the  signal,  which  comprises  one  pulse.  This 
circumstance  makes  it  possible  in  a  number  of  cases  very  simply  to  find  the  formula 
for  the  packet  signal  from  the  corresponding  formulas  for  a  monopulse  signal. 

The  maximum  possible  number  of  unknown  parameters  in  an  incoherent  pulse 
packet  (of  the  type  depicted  in  Figure  12.1)  is  very  large  and  equals  5n. 

The  special  features  arising  during  binary  detection  (simple  and  complex) 
and  during  detection  with  recognition  of  various  packet  types  are  examined  in 
this  chapter. 

12.2  Precisely-Known  Packet  Detection  and  Recognition  /178 

A  packet  is  precisely  known  if  the  type  of  packet  signal  u  (t)  is  precisely 
known.  The  only  unknown  subject  to  determination  in  this  case  is  whether  or  not 
a  packet  is  present.  Evidently,  in  order  for  packet  u (t)  to  be  precisely  known, 

C 

it  must  be  not  only  coherent,  but  all  parameters  of  one  of  this  packet's  pulses, 
such  as  a^f  fp  ^2*  and  also  must  be  precisely  known. 

A  precisely-known  packet  is  a  particular  precisely-known  signal  and  all  the 
formulas  presented  above  for  detection  (simple  and  complex),  recognition,  and 
simultaneous  detection  and  recognition  of  precisely-known  signals  presented  above 
are  valid  for  it  since,  during  its  derivation,  signal  uc(t)  was  written  in  a  general 
form,  i.  e.,  without  cone ret izat ion  of  its  shape. 


*f  rom  the  word  stseolenive— coherence. 
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In  particular,  all  Chapter  5  formulas  and  formulas  (10.12)  and  (11.27)  apply 
here.  Signal  uc(t)  energy  Q  included  in  them  equals 

Q=J«e*(0<«;  (12.1) 

o 

therefore,  as  follows  from  Figure  12.1, 

n 

(12.1a) 

where  Q.  —  energy  of  the  i-th  pulse. 

Consequently,  all  the  aforementioned  error  probability  formulas  remain  valid 
for  precisely -known  packets  if  Q  in  them  is  understood  to  mean  the  energy  of  the 
entire  pulse  packet  equal  to  the  sum  of  the  energies  of  all  n  packet  pulses  [formula 
(12.1a)].  Hence,  it  follows  that,  when  a  precisely-known  analog  signal  is  split 
into  a  series  of  precisely-known  pulses,  signal  detection  and  recognition  errors 
in  all  the  specific  cases  examined  above  remain  unchanged  if,  during  the  split, 
total  signal  energy  remains  unchanged.  In  other  words,  for  simple  binary  detection, 
and  in  a  case  of  orthogonal  signals  also  for  complex  binary  detection,  recognition 
(without  detection),  and  simultaneous  detection  and  recognition,  the  total  requisite 
energy  Q  of  each  signal  u^(t)  does  not  change  when  an  analog  (monopulse)  signal 
is  split  into  a  series  of  precisely-known  pulses.  However,  such  splitting  complicates 
signal  shape,  with  the  result  that  the  structure  of  the  optimum  receiver  (correlator 
or  optimum  linear  filter  structure)  is  complicated  accordingly.  Therefore,  if 
a  system  designer  has  the  capability  to  select  signal  shape,  then  splitting  it 
in  the  aforementioned  cases  into  a  series  of  pulses  must  be  validated  by  some 
other  significant  advantages.  In  radar,  for  example,  one  advantage  of  splitting 
a  monopulse  sinusoidal  signal  into  a  series  of  pulses  is  the  increase  in  system 
range  resolution  due  to  the  reduction  in  the  duration  of  each  elementary  /179 
pulse.  In  addition,  the  conclusion  made  above  concerning  requisite  signal  energy 
invariability  will  apply  to  noise  in  the  form  of  normal  white  noise  and  may  not 
be  valid  for  other  types  of  noise. 


12.3  Inverse  Probability  of  a  Random-Amplitude  Random-Phase  Packet 


Let  packet  uc(t)  be  precisely  known,  with  the  exception  of  amplitude  a^  and 
phase  cj)/*.  Parameters  and  during  each  sequence  [observation  cycle 

(0,  T)]  are  considered  constant  and,  from  one  sequence  to  another,  change  like 
analog  random  magnitudes  with  n-dimensional  a  priori  distributions  P(a^,  .  . 
an)  and  P(  (j)  <pn),  respectively.  Conseauently ,  in  this  case,  signal 

u  (t)  has  2n  random  parameters  and,  in  accordance  with  general  formulas  (8.11) — 
(8.14) ,  its  inverse  probability  may  be  written  in  the  form 


P y  (fljl  •••«  flj'i  fn  ••••  Vn) 

=  (a„ ....  a„)  P  (<P„ ...  f„)  [y  (/)  — ii e  (01. 


(12.2) 


where 


"e(0=»Me(ai . <V,  t ). 

It  is  accepted  here  that  there  is  no  statistical  dependence  between  amplitude 
and  phase  changes. 

In  addition,  we  will  accept  that  the  limits  of  change  of  all  phases 
are  identical  and  equal  0  —  2  7T  .  Then,  based  on  (8.11),  it  is  possible  to  write 


an  an 

Py  (a . an)  =*k,  P  (o, . a„)  j  ...  f  P  (<p  „  ... .  y„)  x 

x  UP.d  ly  (0 — «c(0)rfVi  .  rf'Pn. 


(12.3) 


♦If  moments  ^  of  pulse  onset  are  not  known  precisely,  but  with  errors  At^ 

slight  in  comparison  with  pulse  durations  and  if  a  large  number  of  sinusoid 

periods  (f^  X  ^^^1)  will  be  contained  in  each  pulse,  then  it  is  possible  to 

assume  that  moments  t^  are  precisely  known  and  only  phases  are  unknown  and 
random.  1 

In  other  words,  in  this  case,  the  ambiguity  of  pulse  onset  moments  is  considered 
sufficiently  completely  by  the  fact  that  initial  phases  are  considered  ambiguous. 
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For  noiS8  in  the  form  of  normal  white  noise,  in  accordance  with  (1.25),  we 


Pw(at, ,  a„)  ~kt  /’(«, . an) |  ...  |  P(t p,....,<pn)  X 


7*  w 

x  e*P  ~7f;| 1*  <*>“"«  d«p ,  ...d«p„. 


Expanding  the  square  brackets  under  the  integral,  we  have 


i  n 

i\\av...t  an)-=  k,P  {a, . an)e~Q,N%  j  -  j  P(«p, . <p„)  X 

0  0 

xexp  —  j !/ (0 lte  (0  dt  dcpi ...  d<p„, 

*  o 

where  Q  is  determined  from  formulas  (12.1)  and  (12.1a),  while  kj  - 
coefficient. 


Further,  we  will  examine  several  more-particular  cases. 


(12.4) 


-  normalizing 


First  case.  Packet  pulses  are  coherent  among  themselves.  This  signifies 
that  there  is  a  composite  statistical  link  between  individual  pulse  parameters 
and  the  n-dimensional  probability  densities  in  formula  (12.4)  may  be  replaced 
by  unidimensional  distributions  for  any  of  the  packet  pulses,  for  the  first  pulse, 
for  example.  In  addition,  we  will  accept  that  frequencies  f  of  all  pulses  are 
identical,  while  distribution  of  phases  is  uniform.  Then,  formula  (12.4)  may 
be  written  in  the  form 


xn 


(12.5) 


where 


"  ‘ifi 


<p,)  r=_  V  r  y  (/)  (a,  +  Aa,)  cos  (ro<  +  <Pt  4-  A<P()  d<p,; 

/v*  i  i  », 

Art,  -  a,  -  -rt,;  A»p,  -n  «p,  —  ip,. 


Since  the  pulse  packet  is  coherent,  the  and  A  ace  precisely-known 
magnitudes. 

Second  case.  Packet  pulse  amplitudes  are  linked  among  themselves  by  a 
precisely-known  dependence,  while  phases  ^  ,  are  statistically  independent  and 
distributed  uniformly  (ranging  from  0  to  2  TC  ). 

In  this  case,  it  is  possible  in  formula  (12.4)  to  replace  Py(a^,  •  •  *»  an 
and  P(a^,  .  .  .,  an)  by  PyCa^)  and  P(a^),  respectively  and  also  to  assume 

Hfi . <P„) 

then  formula  (12.4)  takes  the  form 

P„(al)=k,P(al)e  0  dq», dq>».  (12.6) 

!  A  A 


But 


T  n  *!+■«# 

|  y  (0  «c  (0  dt  —  2  |  y  (0  °t  cos  M  4  q>,)  dt\ 

0  f-l  I, 


therefore 


/ib: 


P9  (a,)  -  A,  P  (at)  e iV'"1'  •«*  dip,, 

f-i  n 


(12 


where 


*l(o„  <p,)«= 


tt  f  y(0  cos  («f -f 


U+t, 


(12 


Comparing  expressions  (9.9)  and  (9.10)  with  (12.7)  and  (12.8),  it  is  not 
difficult  to  become  convinced  that,  in  this  case 


Pu(al)=k,P(at)e-0'N> 


(12.9) 


where 


n  2 

t,:  a‘  ==ai  +  Aa> 

J-1 

(AaA  and  —  precisely -known  magnitudes); 


'i+*i  »i+«, 

i  y(t) cos ottdt;  V,  =»  f  y(t)s\nmtdl. 

»  u 


(12.9a) 


Evidently,  magnitude  IT  plays  the  same  role  as  parameter  M  in  the  case  of 
a  monopulse  system  and  may  be  computed  using  identical  approaches,  using  an  optimum 
filter,  for  example.  Since,  during  computation  of  tV  for  various  number  i,  the 
only  thing  that  changes  are  integration  limits,  which  do  not  overlap  among  themselves, 
then  all  PL  magnitudes  may  be  obtained  with  the  aid  of  an  optimum  filter  matched 
only  with  first  pulse  u^(t)  by  means  of  the  corresponding  time  shifts. 


Consequently,  when  using  formula  (12.9)  for  inverse  probability  computation 
in  this  case,  it  suffices  to  have  an  optimum  filter  matched  with  single  pulse 
u^(t)  and  the  appropriate  delay  circuits. 

It  usually  is  more  convenient  to  compute  In  Py(a^)  rather  than  Py(a1);  here, 
instead  of  formula  (12.9)  (assuming  kg  s  1),  we  have 


(12.10) 


Third  case.  Individual  packet  pulse  amplitudes  and  phases  statistically 
are  independent;  phases  are  distributed  uniformly  (ranging  from  0  up  to  2  Tt  )• 

In  this  case,  in  formula  (12.4)  it  is  possible  to  assume 

P(<»i . on)  —P(at) ...  P{an). 


Otherwise,  this  case  does  not  differ  from  the  preceding  one.  Thus, 
instead  of  (12.9),  it  is  possible  immediately  to  write: 
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P,  *  fl  P  <».)  ^  I.  ( ^  ) , 

,-l  V  ■' 


(12.11) 


where  is  determined  from  the  same  formulas  as  in  the  preceding  case. 
12.4  Detection  and  Recognition  of  Coherent  Random  Initial  Phase  Packets 


In  this  case,  the  analysis  may  be  based  on  expression  (12.5)  for  inverse 
probability.  However,  the  basic  relationships  for  a  signal  of  this  type  already 
were  presented  in  preceding  chapters. 


Actually,  since  all  packet  amplitudes  are  coherent  among  themselves,  possible 
signal  u^t)  [signal  u£(t)  in  Figure  12.1]  will  fall  into  the  formula  (9.50)  class 
in  which  functions  a(t)  and  (t)  are  precisely  known  and  describe  the  packet 
pulse  amplitude  and  intitial  phase  change  when  this  pulse's  number  i  changes. 
Therefore,  all  formulas  presented  in  preceding  sections  for  a  signal  with  known 
amplitude  and  random  equiprobable  phase  are  valid  for  error  probabilities  if  energy 
Q  in  these  formulas  is  understood  to  mean  the  energy  of  a  packet  of  n  pulses, 
determined  from  formula  (12.1).  Thus,  for  example,  formula  (11.33)  is  valid  for 
detection  with  recogiition  of  equiprobable  orthogonal  packet  signals  with  equal 
energies,  to  wit 


Q_ 


(j/WlnJL+l/ 


where  Q  s  nQ'  —  packet  energy. 


(11.33) 


i 
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The  structure  of  optimum  receiver  77*  included  in  the  detection  system  is 
determined  from  formule  (9.51).  Psrameter  included  in  this  formuls  may  be 
computed,  as  noted  above,  using  an  optimum  filter  and  amplitude  detector  or  by 
the  Figure  9.1  computational  device.  In  the  latter  case,  cos  U  t  and  sin  U  t 
everywhere  should  be  replaced  by  a(t)  cos  [L/t  +  ^(t)]  and 
a(t)  sin  [  U  t  +  ^/(t)3,  respectively. 

12.5  Coherent  Fluctuating  Packet  Detection  and  Recognition 

A  coherent  packet  is  referred  to  as  fluctuating  if  amplitude  a^  and  phase 
from  one  sequence  to  another  change  as  analog  random  magnitudes  with  a  priori 
distributions  WU^)  and  P(  ^  j)#  respectively.  (Since  the  packet  is  coherent, 
then  fluctuations  in  first  pulse  amplitude  and  phase  fully  determine  the  /183 
fluctuation  of  the  amplitudes  and  phases  of  all  subsequent  packet  pulses). 

Here 


where  P(C)  —  a  priori  distribution  of  packet  presence.  In  this  case,  the  analysis 
also  may  be  performed  based  on  expression  (12.5). 


However,  given  certain  assumptions,  results  may  be  obtained  directly  from 
formulas  presented  in  preceding  chapters.  Thus,  we  will  assume  that  packet  pulses 
have  identical  amplitudee  and  phases  (a^  a  aQ,  f^  a  fQ)  and  amplitude  is  distributed 
based  on  Rayleigh's  law,  while  phase  is  equally  probable  [amplitude  and  phase 
are  unchanged  within  each  observation  cycle  0,  T)].  Then,  error  probabilities 
are  determined  from  the  formulas  presented  in  the  preceding  chapter  for  a  fluctuating 
signal.  Here,  energy  Q«p  should  be  understood  to  mean  the  average  energy  of  the 
entire  packet,  i.  e., 


I 


fl 


where  Q„p ,  —  o,*r,/2  —  average  energy  of  the  i-th  packet  pulse. 

I 


(12.13) 


262 


.Tj,  _..A  .A  _%,  . 


If  all  packet  pulae8  have  identical  duration  £  ,  then 


'rtQepI. 


(12.13a) 


where  Qcp ,  —  average  energy  of  one  packet  pulse.  Thus,  for  example,  during 
fluctuating  packet  detection  with  recognition,  formula  (11.29)  is  valid  if,  in 
it,  Qcp  is  understood  to  mean  average  packet  energy  determined  from  formula  (12.13). 

12.6  Detection  and  Recognition  of  Incoherent  Packets  With  Known  Amplitudes  and 
Independent  Random  Phases 


If  packet  pulse  amplitudes  are  interlinked  by  a  precisely -known  relationship 
and  phases  are  statistically  independent  and  uniformly  distributed  (ranging 
from  0  to  2  7T  ),  then  the  second  case  examined  in  §  12.3  occurs  here  and  analysis 
may  be  performed  based  on  formula  (12.10)  for  packet  inverse  probability. 


Initially,  we  will  examine  simple  binary  detection  of  such  a  packet.  Here, 
it  should  be  assumed  in  formula  (12.10)  that  amplitude  a^  either  equals  precisely- 
known  magnitude  a^  (if  a  packet  is  present),  or  is  identical  with  zero  (if  no 
packet  is  present).  Therefore,  a  response  to  the  question  of  whether  or  not  a 
packet  is  present  requires  comparison  among  themselves  of  inverse  probabilities 
(or  their  logarithms)  Py(C)  and  Py(0)  packet  presence  or  absence,  respectively, 
wta*?e,  in  accordance  with  formula  (12.10) 


In  P, <Q  -In  P,  (*,)-  In P(C)-  £  +  V  l»  U  ( ! 

N*  St  V  "• 


(12.14) 


lnPr(0)  —  InP(O),  I 

and  P(C)  s  P(aQ^)  is  the  a  priori  probeblity  of  packet  presence. 

It  follows  from  (10.5)  and  (12.14)  that  the  decision  must  be  that  a  /184 
signal  is  present  if  the  following  condition  is  met 


(12.15) 


IS  ffiy wl  ^ 


where 


(12.16) 


where  7f  —  weight  fector  considering  the  relative  danger  of  false  alarms  and 
signal  misses. 


If  errors  of  both  types  are  equally  dangerous,  i.  e.,  minimum  composite  error 
probablity  must  be  insured,  then  it  should  be  assumed 

n- 1 

If  condition  (12.15)  is  not  met,  then  the  decision  is  made  that  no  packet 
is  present. 

As  noted  above,  parameter  may  be  computed  (given  certain  restrictions 
that  usually  occur)  by  an  optimum  linear  filter  matched  with  pulse  u^(t)  and  by 
an  amplitude  detector. 

Since  pulses  u^(t)  in  the  general  case  differ  not  only  with  respect  to  amplitude 
and  time  of  action,  but  also  with  respect  to  carrier  frequency  f^,  then  computation 


</>  _3Z 


XpOHUjamop 


V 


Figure  12.2.  (a)  —  OF  [optimum  filter]}  (b)  —  D  [detector]; 

(c)  —  TsZ  [delay  circuit];  (d)  —  Timer. 

of  the  sum  included  in  inequality  (12.15)  requires  n  individual  filters  or  one 
filter,  but  with  parameters  changing  from  pulse  to  pulse.  In  the  latter  case, 
the  structural  schematic  of  the  optimum  binary  detection  system  has  the  form  depicted 
in  Figure  12.2.  The  timer  insures  optimum  filter  connection  only  for  time  intervals 
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corresponding  to  the  time  of  action  of  the  anticipated  pulses,  and  also  insures 
a  change  in  filter  amplification  and  tuning  frequency  from  r'lse  to  pulse  in 
accordance  with  the  changes  in  amplitudes  and  frequencies .  f;  of  the  individual 
packet  pulses.  If  the  anticipated  pulses  have  identical  amplitudes  and  frequencies 
the  requirement  for  the  latter  two  operations  is  eliminated. 

Amplitude  detector  ft  with  an  In  IQ(x)  type  response  curve,  delay  circuit 
U.3  ,  adder  I!  ,  and  threshold  bias  UQ  are  connected  beyond  the  optimum  filter. 
When  threshold  UQ  is  exceeded,  i.  e.,  where  “»u« >  0  ,  the  response  is  that  a 
packet  is  present.  Otherwise,  the  decision  is  that  no  packet  is  present. 

For  binary  detection  of  a  monopulse  signal  (n  =  1),  the  response  curve  /185 
shape,  as  noted  in  §  11.1,  may  be  replaced  by  any  monotonic  curve,  with  appropriate 
threshold  bias  UQ  adjustment.  In  the  given  case  (where  n  i  1),  this  postulation, 
strictly  speaking,  is  invalid  since  the  adding  operation  will  occur  prior  to  com¬ 
parison  with  the  threshold.  However,  even  in  this  case,  choice  of  the  response 
curve  shape  is  not  very  critical,  especially  if  the  number  of  pulses  in  the  packet 
is  small. 

The  probability  of  satisfying  inequality  (12.15)  when  there  is  and  is  not 
a  signal  present  should  be  computed  to  find  error  probabilities.  In  the  general 
case,  solution  of  this  problem  encounters  significant  mathematical  difficulties. 
Therefore,  several  assumptions  have  to  be  made. 

First,  we  will  assume  that  n  ^^1.  Here,  we  may  assume  that  the  law  of 
distribution  of  the  sum  at  the  left  side  of  inequality  (12.15)  is  normal  since 
the  terms  making  up  this  sum  are  statistically  independent. 


We  will  designate 


where 


! 


*1  = 


'"'•(iSru> 


(12.18) 


then,  in  light  of  the  assumption  made,  it  is  possible  to  write  the  law  of  magnitude 
y  distribution  in  the  form 


/><!!)-  -<»-SW>V,  (12.19) 

where  ~j  and  <V  —  magnitude  y  mean  value  and  dispersion. 

We  will  compute  parameter  y  and  <%*  values  when  a  signal  is  and  is  not 
present.  It  follows  from  (12.17)  that 


»-%}'■  ".‘-i":,  U2.20) 

where  *i  and  <rf|*  —  term  mean  value  and  dispersion.  Thus,  it  suffices  for 
finding  distribution  P(y)  to  compute  random  magnitude  mean  value  and  dispersion 
determined  from  formula  (12.18). 

We  will  designate 


then 


i  . 

N.  ' 


=  In  !,(*,). 


(12.21) 


(12.22) 


We  will  assume  that 


x,  -  I; 


/186 

(12.22a) 
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then,  considering  the  properties  of  function  we  will  obtain 


Consequently , 


(12.23) 


Considering  (9.13),  the  result  is 


(%'+¥?)■  (12.24) 

2 a  -ir  2d 

But,  we  found  in  §  9.2  that  magnitudes  X  and  ^  Y  when  there  is  a  signal 
are  independent  random  magnitudes  with  zero  mean  values  and  with  dispersions  equalling 
2Q/Nq.  Consequently,  in  the  case  under  examination,  this  must  be  the  case 


Therefore,  when  there  is  no  signal  (i.  e.,  only  noise),  formula  (12.24)  provides 


(12.25) 

It  is  convenient  to  use  the  following  approach  to  find  dispersion  <>*,  when 
a  signal  is  and  is  not  present,  as  well  as  to  determine  magnitude  77  when  there 
is  a  signal. 


Magnitude  M.,  as  was  demonstrated  above,  is  proportional  to  optimum  linear 


filter  output  voltage  envelope  11^(0  (at  moment  t  =  T).  Therefore,  it  is  possible 
to  write  expression  (12.23)  in  the  form 


*i=AUi,(?), 


(12.26) 


where  A  —  some  constant. 

This  equation  corresponds  to  the  square-law  detector  response  curve  equation. 
Therefore,  may  be  considered  square-law  detector  output  voltage. 

When  there  is  no  signal,  only  noise  with  a  normal  law  of  distribution  arrives 
at  the  input  of  this  detector.  Here 


it  ~AUl-A2Ul, 


where  Um*  —  mean  square  of  the  noise  voltage  at  detector  input. 


(12.27) 


Dispersion  a»*  in  this  case  is  nothing  other  than  the  mean  square  of  the 
noise  voltage  at  square-law  detector  output,  therefore 


AUU*m. 


(12.28) 


When  there  is  a  signal,  detector  output  voltage  is  the  sum  of  normal  /187 
noise  with  extant  value  Um  and  a  harmonic  signal  with  amplitude  Um  „  .  Here, 
the  result  at  square-law  detector  output  is 


i,  -  A2U, 


s('<© 


(12.29) 


(12.30) 


It  follows  from  comparison  of  relationships  (12.25)  and  (12.27)  that 


A2Ul~$L. 


(12.31) 


Ratio  UmvIVm  in  formulas  (12.29)  and  (12.30)  is  the  ratio  of  signal  amplitude 
(at  moment  t  s  T)  to  the  mean  square  noise  voltage  at  optimum  linear  filter  output. 
Therefore,  in  accordance  with  formula  (2.34),  we  have 

<*■  «?» 

U*u  N.  ' 

Substituting  eKpressions  (12.31)  and  (12.32)  into  formulas  (12.27) 
we  obtain: 

a)  when  there  is  no  signal 


(12.32) 

—(12.30), 


b)  when  there  is  a  signal 


(12.33) 


(12.34) 


Further,  we  will  assume  for  simplicity  that  all  packet  pulses  have  identical 
energies,  i.  e., 


Q, 


(12.35) 


where  Q  —  energy  of  the  entire  packet. 

Then,  fi.  irmulas  (12.20),  (12.33),  (12.34),  and  (12.35),  we  will  obtain: 
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a)  when  there  is  no  signal 


Q  .  l  Q* 

*-*s  *~T 'w' 


b)  when  there  is  a  signal 


(12.36) 


(12.37) 


Based  on  the  resultant  data,  it  is  not  difficult  to  compute  false-alarm  and 
signal  miss  probabilities. 

Actually,  it  follows  from  relationships  (12.15)  and  (12.17)  that  the  /188 
false-alarm  probability  is  the  probability  that  this  inequality  is  satisfied 


y>u. 


when  there  is  no  signal,  i.  e., 


-  If  dj,, 

b,  V2nov*  jJt 

where  y  and  ap*  are  determined  by  formulas  (12.36). 


(12.38) 


Signal  miss  probability  is  the  probability  that  this  inequality  is  satisfied 


y<u. 


when  there  is  a  signal,  i.  e., 


where  y  and  or*  are  determined  from  formulas  (12.37). 


(12.39) 
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be  somewhat  different  and  must  be  determined,  not  from  formula  (12.16),  but  from 
the  following  formula: 


P(  0) 
P(C ) 


ff. 


(12.16a) 


As  analysis  shows,  if  ('"L^TqJI  *  then  correction  e  has  the  following 

form: 


e 


We  will  examine  a  case  of  slight  error  probabilities  P„  T  and  . 

It  follows  from  formulas  (12.40)  that  probabilities  P.i  T  and  P„ P  may  be 
slight  only  if  limits  CX  ^  and  CX  ^  are  sufficiently  large;  but,  given  a  suf- 
ficiently-large  lower  limit  CC  ,  it  is  possible  to  use  asymptotic  expansion 
of  the  probability  integral  and  to  assume 


a 


e-«-/2. 


therefore,  instead  of  (12.40),  it  is  possible  to  accept 


P 


JIT 


—Jr—  e -«'’/*• 
V  «i 


P„  e—//*, 

p  I-2.1  a. 


(12.42) 


hence,  we  obtain: 


»2  In— - 1,85—2  In*,; 

Pi it 

«21n-J - 1,85— 2  In  ct,. 


Since  in  this  case  CX  ^  and  CX  ^  are  sufficiently  great,  then  it  is  possible 
in  (12.43)  in  the  first  approximation  to  disregard  terms  2  In  CX  ^  and 
2  In  CX  2  comparison  with  CX  ^  and  CX  £ f  respectively.  The  result  is: 


oe,*  =»  2  III  — ? - 1,85; 

Pji  t 

a,*  at  2  In  — - 1^85. 

P  np 


(12.44) 


However,  as  the  analysis  shows,  it  is  advisible  for  best  approximation,  instead 
of  (12.44),  to  assume 


a,’ «  2  In  — 
fat 

a,!  ^  2tn  — 

Pop 

On  the  other  hand,  solving  equation  (12.41)  for  magnitude  Q/N  ,  we  /190 
obtain 

^  -  ( Y^n  +  a,*)  Y  ( V'Zfr  +  a,1)*  ~  (a,»  _  a,*)  n.  (12.46) 

Formulas  (12.45)  and  (12.46)  make  it  possible  quite  simply  to  compute  the 
packet  energy  Q  required  to  insure  permissible  error  probabilities  P„  T  and 
Pnp  for  a  giver  number  of  pulses  n,* 

As  computations  show,  where  0,1  and  Pnp  <0,1  ,  the  formula  (12.46) 

error  does  not  exceed  0.5  dfi,  compared  to  more-precise  integral  relationships 
(12.40).  However,  it  should  be  remembered  that  both  formula  (12.46)  and  relationships 
(’.2.40)  are  sufficiently  precise  only  where  n  1. 

*It  follows  from  relationships  (12.41),  (12.45),  and  (12.46)  that,  for  given 
error  probabilities  Pm  and  P„p  ,  threshold  UQ  magnitude  must  be  determined 

from  requisite  magnitude  GC^,  i»  e.,  from  formula  *  '“here  Q/Nq 

is  determined  from  formula  (12.46),  while  CX.  and  CX  ,  are  determined  from 
formulas  (12.45).  1  L 


2  .  2 

If  n  is  very  large,  such  as  n  ^Ct  ^  and  n^CX^  or 

n  (a,  4- a,)1,  (12.47) 

where  CC^  and  OC  ^  in  tum  are  magnitudes  greater  than  unity,  then  formula  (12.46) 
takes  the  form 


(o,4-ot)  V'Ti.  (12.48) 

In  addition,  during  derivation  of  formulas  (12.46)  and  (12.48),  the  assumption 
was  made  [see  (12.22a)]  that  x^^l.  We  will  clarify  the  concept  behind  this 
assumption. 

It  follows  from  relationships  (12.23),  (12.33),  and  (12.34)  that,  when  there 
is  no  signal 


while,  when  there  is  a  signal 


consequently,  instead  of  (12.22a),  it  is  possible  to  write  the  following  condition: 


(12.49) 


i.  e.,  the  signal-to-noise  power  ratio  for  each  (individual)  packet  pulse  must 


be  sufficiently  small.  Here,  the  optimum  response  curve  is  a  square-law  curve 
(see  equality  (12.26)]  and  the  conclusions  made  above  are  valid. 

If  condition  (12.49)  is  not  met,  i.  e.,  the  signal-to-noise  ratio  for  every 
packet  pulse  is  not  too  low,  then  formulas  (12.46)  and  (12.48)  will  remain  valid 
if,  as  usual,  one  assumes  that  the  response  curve  is  a  square-law  curve.  However, 
in  this  case,  such  a  curve  already  will  not  be  optimum  since  the  optimum  /191 
characteristic  is  described  by  equation  (12.22)  and,  given  suffiently-large  x^, 
will  strive  towards  straight  line 

(12.50) 

i.  e.,  towards  the  characteristic  of  a  linear  detector. 

Hence,  it  follows  that,  when  condition  (12.49)  is  not  met,  somewhat  better 
results  (lower  Q/Nq)  may  be  obtained  in  the  optimum  receiver  than  is  the  caee 
from  formula  (12.46). 

During  formula  (12.46)  derivation,  we  assumed  that  the  law  of  distribution 
of  the  (12.17)  sum  was  normal,  which  is  valid  only  for  very  large  n  values.  Kaplan 
obtained  more-precise  results  [16].*  He  also  assumed  a  square-law  response  curve, 
i.  e.,  in  accordance  with  (12.17),  he  assumed  that 

(12.51) 

Here,  when  there  is  no  signal,  magnitude  y  has  a  chi-square  distribution 
with  2n  degrees  of  freedom. 

Kaplan  approximated  this  distribution  as  an  expansion  into  a  power  series 
with  respect  to  a  normally -distributed  magnitude,  which,  naturally,  provides  greater 
precision  than  the  above  P(y)  distribution  normalization. 

*Useful  relationships  for  a  case  of  square-law  detection  [with  respect  to  law 
(12.15)]  will  be  found  also  in  the  V.  S.  Sragovich  article  [123]. 


Kaplan  a3sumed  distribution  P(y)  as  normal  for  a  case  of  signal-plus-noise 
activity  since,  in  this  case,  on  the  one  hand,  distribution  P(y)  is  closer  to 
a  normal  distribution  (than  when  there  is  no  signal)  and,  on  the  other  hand,  it 
turns  out  to  be  more  complicated  to  obtain  a  more-precise  approximation  here. 

The  formula  Kaplan  obtained,  using  our  designations,  may  be  reduced  to  the 
following  form: 


(12.52) 


where  CX^  and  CX^  are  determined  by  formulas  (12.45)  (where 
P^sJO.I  and  P„<0,1  ). 

When  n  ^1,  formulas  (12.52)  and  (12.46),  as  could  be  expected,  coincide 
and  boil  down  to  very  simple  expression  (12.48).  However,  if  n  is  not  too  large, 
i.  e. ,  condition  (12.47)  is  not  met,  more-precise  formula  (12.52)  should  be  used. 

However,  this  formula  also  only  is  valid  when  n  values  are  not  too  small 
and  its  error  increases  with  a  decrease  in  n.  In  the  worst  case,  when  /192 
n  s  1,  from  formula  (12.52)  we  obtain 


+  +  l  +  2«,  +  ~  *i*-f  «**. 


(12.53) 


then,  when  in  actuality  n  s  1,  formula  (9.49)  is  valid,  which,  considering  (12.44), 
provides 


It  follows  from  comparing  formulas  (12.53)  and  (12.54)  that,  when  these  con¬ 
ditions  are  met 


^np^O.l  and  /\,T<P„p  (12.55) 

(i.  e.,  when  Ct2^i,65  and  OC  '^,CC  2),  which  usually  is  the  case,  the 
formula  (12.53)  error,  compared  to  formula  (12.54),  does  not  exceed  2  cb.  Hence, 
it  follows  that  formula  (12.52),  precise  where  n^l,  also  provides,  for  any 
slight  changes  in  n,  a  relatively -small  error  not  exceeding  2  dB. 

It  is  possible  to  reduce  this  error  further  if  formula  (12.52)  is  adjusted 
in  the  following  way. 

Instead  of  (12.52),  we  will  assume  that 


(12.56) 


This  expression  differs  from  (12.52)  only  in  additional  term  A,  which  does 
not  depend  on  nj  therefore,  where  n  — ►  formula  (12.56)  coincides  with  (12.52), 

i.  e.,  as  usual  provides  the  correct  result. 

We  will  select  magnitude  A  so  that  formula  (12.56)  will  provide  a  correct 
result  in  another  extreme  case  as  well,  i.  e.,  when  n  =  1.  Then,  when  n  a  1, 
comparing  the  right  sides  of  equalities  (12.54)  and  (12.56),  we  will  find  that 


A  -  ^  +  <*,*  +  a,  +  * ,  j/  H-  2*|  +  }  <*,*  +«,■)• 


Substituting  this  expression  into  (12.56),  we  obtain  the  formula 


(*l  t  'Xj 


+  «»*  + 


-1-1  /i\«,  +  a,  i  i -f- 7=3=  -f-  — - /, 

"  I  »  n  / 

which  we  will  called  the  adjusted  Kaplan  formula. 

It  is  possible  to  use  this  formula  for  any  given  values  of  n  [if  condition  /193 
(12.55)  is  met]. 


Where  n  =  1  and  where  n  >1.  formula  (12.57)  provides  essentially  precise 

results,  while  when  the  values  of  n  are  intermediate,  the  error  a  fortiori  is 
less  than  2  s. 


The  family  of  curves  in  Figure  12.3  was  plotted  from  formulas  (12.57)  and 
(12.45)  for  two  error  probability  values.  Depicted  in  Figure  12.4  for  comparison 
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Figure  12.3.  (a)  —  d>, 


is  dependence  Q,/N  on  n,  where 
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Figure  12.4 


.2 


It  is  evident  from  Figures  12.3  and  12.4  that  an  increase  in  the  number  of 
pulses  n  in  the  packet  means  a  decrease  in  requisite  energy  of  each  pulse, 
but  packet  energy  Q  as  a  whole  increases.  This  denotes  that  division  of  the  analog 
signal  into  a  train  of  incoherent  pulses  is  disadvantage us  from  the  power  standpoint 

The  resultant  loss  in  requisite  energy  is  characterized  by  the  ratio 

’li  =  •  (12.59) 

where  Q'  —  energy  required  for  an  analog  (monopulse)  signal,  i.  e.,  where  n  =  1. 

It  follows  from  (12.57)  and  (12.59)  that 
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3 

(12.60) 

Magnitude  T(  ^  is  called  the  energy  loss  due  to  pulse  incoherence.  If  /194 
pulses  are  coherent  among  themselves,  but  pulses  are  added  without  consideration 
of  their  rf  occupation  phases,  i.  e.,  with  respect  to  law  (12.17),  then  the  same 
loss  in  energy  occurs  as  is  the  case  if  the  pulses  themselves  had,  relative  to 
each  other,  independent  equiprobable  random  phases.  Therefore,  factor  7^  ^ 
often  is  referred  to  also  as  the  energy  loss  due  to  packet  pulse  adding  incoherence. 

The  solid-line  curves  in  Figure  12.5  were  plotted  from  formula  (12.60). 

Since  formula  (12.60)  was  obtained  from  expression  (12.57),  then,  just  as 
is  the  case  for  (12.57),  it  only  is  precise  when  n  =  1  and  when  n  1.  Where 
n  values  are  intermediate,  formula  (12.60)  provides  somewhat-inflated  magnitude 
.  values,  however  the  error  here  a  fortiori  is  less  than  2  d0. 
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Figure  12.5.  (a)  —  < 8. 

The  corresponding  curves  S.  Ye.  Fal'kovich  [5]  obtained  is  a  somewhat  more- 
precise  way  are  plotted  with  dotted  lines  in  Figure  12.5  for  comparison.  Evidently 
the  divergence  among  the  corresponding  curves  here  does  not  exceed  1.5  dB. 

It  follows  from  (12.60)  that,  when  n  1, 


(here  P„ T  <  0. 1  and  P„„  <  0. 1  ) . 

The  following  conclusion  may  be  made  based  on  formulas  (12.60)  and  (12.61) 
and  the  Figure  12.5  curves: 

1.  The  loss  due  to  incoherence  rises  with  a  rise  in  n  and,  given  large  n 
values,  this  increase  is  proportional  to  yH  . 

2.  The  loss  decreases  with  an  increase  in  permissible  error  probabilities 
Pa  t  and  Pup  • 

3.  Where  n  10,  the  loss  is  relatively  small  and,  given  typical  error 
probability  values,  does  not  exceed  1—3  dB. 


On  the  other  hand,  given  large  n  values,  the  loss  may  be  quite  considerable; 
therefore,  if  n^>  1  and  packet  pulses  are  coherent  among  themselves,  then  it 
is  very  significant  that  their  adding  be  coherent. 

The  aforementioned  analysis  applies  to  simple  binary  detection.  The  basic 
results  and  approach  presented  in  the  preceding  chapter  are  fully  applicable  for 
analysis  of  simultaneous  detection  and  recognition  of  m  possible  packets  /195 
u^(t),  .  .  .,  u^t).  In  particular,  in  the  case  of  m  orthogonal  equally-probable 
packets  with  identical  energies  and  given  slight  error  probabilities 
and  S*ap<0,l  ) ,  requisite  packet  energy  Q  is  determined  by  the  simple  binary 
detection  formula  if  you  replace  p»  r  with  P„  rJni in  it. 

\ 

Therefore,  for  detection  with  recognition  of  m  packets  (or  in  the  Figure 
11.5  m-channel  system),  formulas  (12.57),  (12.60),  and  (12.61)  in  particular  are 
valid  if,  in  these  formulas,  Ct  ^  is  determined,  not  from  formula  (12.45),  but 
the  assumption  is  that: 


«»•  *  2  In  ^2 - 2.8  «  2  In  m  +  2  In  J - 2,8. 

p»» 

Here,  for  example,  formula  (12.61)  takes  the  following  form: 


(12.62) 


1/  2ln«  +  2ln^ - 2,8+1  2In~-  —  2,8 

I  "* »  I  "up 


(12.63) 


where  n  ^  1,  pnp<Q’K 


It  is  evident  from  this  formula  that,  when  m  increases,  the  loss  due  to  inco- 


12*7  Detection  and  Recognition  of  Incoherent  Packets  With  Amicably-Fluctuating 
Pulse  Amplitudes 

The  following  two  extreme  types  of  packet  amplitude  fluctuations  are  the 
most  charset  eristic: 


a)  amicable  fluctuations; 

b)  independent  fluctuations 


For  amicable  fluctuations,  the  amplitude  of  pulses  within  a  given  packet 
is  unchanged  or  is  a  known  regular  time  function,  while  it  changes  from  one  packet 
to  another  (i.  e.,  from  one  sequence  to  another)  as  an  analog  random  magnitude 
with  known  a  priori  distribution. 


For  independent  fluctuations,  amplitudes  of  different  packet  pulses  are 
independent  random  magnitudes. 


We  will  examine  amicable  amplitude  fluctuations  in  this  section.  Here,  the 
second  case  examined  in  §  12.3  occurs  here  and  the  inverse  probability  of  presence 
of  a  packet  with  amplitudes  (a^,  .  .  .,  an)  is  determined  by  expression  (12.9): 


where 


-2  ?llL  - 

/*,(«. . a»)=p* («.)=*, />(«.)«  2Ar«  ni.(a-^M. 

i- 1  v  N*  ' 


(12.9) 


d<=aj  +  Aa,; 
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Aa^  and  T\  —  precisely-known  magnitudes: 


P(ai)~P(QW(oty, 


(12.64) 


here  P(C)  —  a  priori  probability  of  the  presence  of  a  packet  with  random  amplitude 
a^  value,  while  W(a^)  —  law  of  distribution  of  packet  pulse  amplitude  a^,  given 
that  a  packet  is  present. 

It  is  convenient  to  represent  expression  (12.9)  in  the  form 


P,  (<*:)  =Kp  («!>  e*P- 


(12.65) 


In  simple  packet  binary  detection,  a  comparison  is  made  of  inverse  probabilities 


Py(C)  and  Py(0)  of  packet  presence  and  absence*  respectively.  The  decision  is 
that  a  packet  is  present  when  this  condition  is  set 

P,<C)>W0),  (12. 


where  7]  —  weight  factor. 

If  condition  (12.66)  is  not  met,  the  decision  is  that  no  packet  is  present. 
Evidently, 


PAC)=  J  p y 

0  (12.6; 

therefore,  considering  (12.64)  and  (12.65),  we  have 


p.<q-*iP(C>  [  r L|  [“^!  + 


da. 


(12.61 


If  the  a  priori  probability  of  packet  absence  equals  P(0),  then 


P„(0)=*,P(0).  (12.6! 

Consequently,  the  decision  must  be  made  as  to  packet  presence  when  this  condition 
is  met 


y>u» 


(12.71 


where 


(12.7 


Further,  for  simplicity,  we  will  assume  that  the  amplitudes  and  durations  /197 
of  all  packet  pulses  are  identical,  i.  e., 

fli  =  o,  =  ...  =  «;  t,  =  xg  =  ...  =  t; 

then,  expression  (12.71)  takes  the  form 


?  na‘x  2  Inl.j  — «.) 

0  =  $lP(<J)e  2(V*e'-> 


(12.73) 


We  will  examine  two  extreme  cases: 


first  case 


Q‘  rx. 


(12.74) 


second  case 


3l  s, 

/V,  *  '• 


(12.75) 


It  was  demonstrated  in  the  preceding  section  that,  in  the  first  case,  it 
is  possible  to  assume: 

.  .  /20/M,  \  I  /2oAf,  \* 


/  2oAlj  \  I 

inl'(-N7!^-r(-R7)  • 


while,  in  the  second  case 


|2oAf|X  2aM, 

]n'9\~W )  ^  ~Fi7 


Therefore,  in  the  first  case,  from  (12.73)  we  obtain 
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y  =  )  W  (a)  e 


2N,  3  KJ 


(12.76 


while,  in  the  second  case 


a>  a1  Tit  3a  y  ^ 

y  =  S«7(a)e~ 

o 


(12.77 


It  follows  from  (12.70)  that,  for  packet  detection,  the  receiver  must  compute 
magnitude  y  and  compare  it  with  threshold  Uq.  But,  it  follows  from  expressions 
(12.76)  and  (12.77)  that,  instead  of  comperision  of  magnitude  y  with  threshold 
UQ,  it  is  possible  in  the  first  case  to  compare  with  some  other  threshold  U  • 
magnitude 


(12.78 


while,  in  the  second  case,  magnitude 


-  2  M,. 

i- 1 


(12.79 


This  possibility  results  from  the  fact  that,  as  follows  from  (12.76)  and 
(12.77),  there  is  a  regular  monotonic  dependence  between  y  and  R^  vor  y  and  R£). 
Therefore,  an  overrun  (or  non-overrun)  of  some  other  threshold  U  *  by  magnitude 
R^  (in  the  first  case)  or  R2  (in  the  second  case)  always  will  correspond  to  an 
overrun  (or  non-overrun)  of  threshold  UQ  by  magnitude  y. 


Expressions  (12.78)  and  (12.79)  coincide  with  the  corresponding  expressions 
obtained  in  the  preceding  section  for  packets  with  known  pulse  amplitudes.  Conse¬ 
quently,  optimum  receiver  structure  remains  the  same  (Figure  12.2),  with  the  only 
change  being  the  magnitude  of  threshold  bias  UQ  at  receiver  output.  Hence,  it 
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follows  that  a  change  in  pulse  amplitude  law  of  distribution  W(a)  leaves  the  optimum 
receiver  structure  and  mode  unchanged,  with  the  exception  of  the  threshold  bias 
UQ  magnitude. 

A  square-law  detector  [formula  (12.78)]  is  optimum  for  every  pulse,  given 
a  low  power  ratio  Qj/N0>  while  a  linear  detector  [formula  (12.79)]  is  optimum, 
given  a  high  ratio  Qj/N  . 

If  the  number  n  of  packet  pulses  is  small,  then  a  high  power  ratio  Q^/Nq 
is  required  for  highly-reliable  detection  (i.  e. ,  slight  error  probabilities 
P„ ,  and  Pnp ) ;  if  number  n  is  very  large,  the  reliable  detection  results  already 
when  Qj/Ng  1  since  power  ratio  nQ^/NQ  turns  out  to  be  sufficiently-high  for 
the  packet  as  a  whole.  Therefore,  in  practice,  the  first  case  (12.74)  occurs 
where  n ^^>1,  while  the  second  case  (12.75)  occurs  at  slight  n  values. 

The  law  of  distribution  of  sums  and  R2,  determined  from  formulas  (12.78) 
and  (12.79),  should  be  found  in  order  to  find  error  probabilities  P  a  Tand  Pn P  . 

Since  optimum  receiver  structure  in  this  case  is  the  same  as  for  reception 
of  packets  with  known  amplitude,  as  examined  in  the  preceding  section,  then,  the 
law  of  distribution  of  magnitudes  R^  and  R2  turns  out  to  be  the  same  when  there 
is  no  signal  and,  consequently,  the  fa  Is e-alarm  probability  P„  T  expression  remains 
unchanged.  However,  when  there  is  a  signal,  the  law  of  distribution  of  sums  R^ 
and  R2  is  different  than  is  the  case  for  packets  with  known  amplitude  and  will 
depend  an  the  law  of  packet  amplitude  distribution  W(a). 

Kaplan  analyzed  the  case  of  a  Rayleigh  law  of  distribution  W(a)  for  the  first 
time  [16].  Kaplan  only  examined  the  first  case,  when 


£1  and  Ol. 


His  results  using  our  designations  may  be  reduced  to  the  following  approximate 
form*  /199 


where 
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(12.80) 


In  these  ratios,  Qep  —  nQepl  =  n  t  is  packet  energy  mean  statistical  value. 
Fornulas  (12.80)  are  sufficiently  precise  for  the  following  assumptions: 


*7 


<  I;  n  > 


^np 0, 1 . 


(12.81) 


The  error  in  formulas  (12.80)  asymptotically  will  strive  towards  zero  due 
to  strengthening  of  inequalities  (12.81). 

It  follows  from  (12.80)  that,  given  sufficiently-large  values  of  n,  it  is 
possible  to  assume: 


N, 


2  tn-L -2,8 

P*  T _ 

Pup 


Yn. 


(12.82) 


In  light  of  a  decrease  in  the  number  n  of  pulses,  relative  formula  (12.80) 
error  rises  and  reaches  its  greatest  magnitude  where  n  s  1.  Here,  from  (12.80) 
we  obtain: 


Qep  l 


a+«i*  \ 


(12.83) 
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where 


a, =*]/  2  In  —  2,8 . 


In  actuality,  where  n  s  1,  relationship  (9.84)  occurs,  i.  e., 


(9.84) 


It  follows  from  comparison  of  relationships  (12.83)  and  (9.84)  that,  when 
n  =  1  (and  Pm  T<0,1  ),  the  formula  (12.80)  error  deee  not  exceed  2  tfi.  Therefore, 
it  is  permissible  to  adjust  formula  (12.80)  so  that  it  will  provide  a  precise 
result  not  only  when  n  ^s>  1»  but  also  when  n  a  1. 

Using  a  procedure  identical  to  that  used  in  the  preceding  section,  /200 
i.  e.,  instead  of  (12.80),  assuming  that 


where  A  —  adjusting  term  not  depending  on  n,  and  accepting 


2ln-L., 

Pat 


we  will  obtain: 


*-¥-■!— * 


(12.84) 


We  will  cell  this  formula  an  adjusted  Kaplan  formula  for  amicdbly-fluctueting 
packets.  It  provides  a  sufficiently -precise  result  if  n  «  1  and  n^l  (where 


^nP<0,l  and  P„  T<0,l  ).  For  intermediate  values  of  n,  formula  (12.84)  provides 
a  somewhat-inflated  ratio  Qep/iV0  value.  However,  the  resultant  error  here  is 
less  than  2  s. 

As  was  the  case  in  the  preceding  section,  we  will  introduce  the  concept  of 
loss  in  requisite  signal  energy  caused  by  packet  pulse  incoherence  (or  adding 
incoherence  if  the  pulses  are  coherent): 


(12.85) 


where  Q«p'  —  average  energy  required  for  an  analog  (monopulse)  signal,  i.  e., 
where  n  =  1. 


Then,  from  formula  (12.84),  we  will  obtain 


q,  =  1  + 


2(l'n-l) 


(12.86) 


It  follows  from  comparison  of  the  Figure  12.5  and  12.8  curves  that  the  dependence 
of  requisite  signal  energy  on  the  number  n  of  packet  pulses  in  the  case  of  amicably- 
fluctuating  packets  is  approximately  identical  to  that  in  the  case  of  packets 
with  a  known  amplitude.  The  loss  due  to  packet  pulse  incoherence  (or  due  /201 


to  pulse  adding  incoherence  if  the  pulses  are  coherent)  correspondingly  are 
proximately  identical  as  well. 

The  results  presented  above  applied  to  a  case  of  simple  binary  signal  detection 
The  approach  and  basic  results  presented  in  the  preceding  chapter  are  applicable 


Figure  12.7  Figure  12.8.  (a)  —  cb 

fully  for  simultaneous  detection  and  recognition  of  m  possible  packets  Uj^(t), 

.  .  .,  ^(t).  In  particular,  in  the  case  of  m  orthogonal  equiprobable  packets 
with  identical  average  energies  and  slight  error  probabilities  (  P. it  <0.1  and 

Pnp  <0,1  ),  requisite  average  packet  energy  QCp  is  determined  from  the  simple 
binary  detection  formula  (or  curves)  if  P*  r  is  replaced  by  P„  -,1m  in  it. 

Here,  formulas  (12.84)  and  (12.86)  take  the  following  form  (given  the  same 
assumptions) : 


+  2,n£7T+2,nm|:  (12.87) 


2(K«-I) 


j/"  81nj^-  +  2lnm 


(12.88) 


12.8  Detection  and  Recognition  of  Incoherent  Packets  With  Independently-Fluctuating 
Pulse  Amplitudes 


The  third  case  examined  in  §  12.3  occurs  for  independent  pocket  pulse  amplitude 


fluctuations  and  the  inverse  probability  of  the  packet  with  amplitudes  (a^t  . 
.  . ,  an)  is  determined  from  formula  (12.11),  to  wit 


P y  (fl|l  .... 


V.) 
N,  r 


Therefore,  inverse  probability  P^(C)  of  packet  presence  (with  any  pulse  /202 
amplitudes)  equals 


m  m 

P, (Q  - *5 ...  j P, (a, . aj dax ...  da,  - 


i-l  6 


(12.89 


But 


P(a,)**P(C)W  (a,), 

where  P(C)  —  a  priori  packet  presence  probability,  and 


(12.90 


a  «• 


(12.9] 


Further,  we  will  assume  that  amplitudes  a^  fluctuate  in  accordance  with  the 
Rayleigh  law.  Then,  comparing  (12.91)  with  (9.56)  and  (9.65),  we  obtain: 


P,  (Q-kPiQU^r  *«\ 

i-i  «?< 


(12.92 


where,  in  accordance  with  (9.63)  and  (9.64) 
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It  follows  from  (12.92)  that  the  inverse  probability  of  packet  absence  (i.  e., 
presence  of  a  packet  with  zero  pulse  amplitudes)  equals 

P,(0)=*P(0); 

therefore,  for  simple  binary  detection,  the  decision  as  to  signal  (packet)  presence 
must  be  made  when  this  inequality  is  satisfied 


P,(Q>tlP(0). 


(12.94) 


In  future,  we  will  accept  for  simplicity  that  all  packet  pulses  have  identical 
average  energies  and  durations,  i.  e., 

•••  “ftp »• 

—  =*; 

Qcp^nQcptl 


then,  considering  relationships  (12.92)  and  (12.93),  it  is  possible  to  write  signal 
presence  condition  (12.94)  in  the  following  form: 


V>U» 


where 


/-i 


+^)MT,7n?0  +  ,n(l  +S0]- 

Here,  as  was  the  case  earlier  [see  (12.9a)], 


(12.95) 
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(12.96) 


(12.97) 
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therefore 


+*'?)•  (12.98) 

It  follows  from  (12.96)  that,  in  the  case  being  examined,  i.  e.,  for  independent 
packet  pulse  fluctuations,  a  square-law  characteristic  is  the  optimum  response 
curve. 


Initially,  we  will  examine  a  case  where  there  is  no  signal  in  order  to  find 
detection  error  probability. 


It  follows  from  (12.9a)  that,  when  there  is  no  signal 


/,+t 

X,  =  5  tim  (t)  cos  mt  dt\ 

}i 

<,+t 

K,  =  5  Um  (t)  sin  mtdt. 
1 1 


(12.99) 


r 

It  was  demonstrated  in  §  $.2  that  random  magnitude  ^  j  ua(/)ue  (t)dl  has  a 
normal  law  of  distribution  with  zero  mean  value  and  dispersion  2Q/Nq,  where 

T 

Q  =  J  He*  (0  dt  }  hence,  it  follows  that  magnitudes  and  Y^,  determined  by 
expressions  (12.99),  have  a  normal  distribution  with  zero  mean  values  and  dispersions 

<*i*  t 


(12.100) 


Magnitudes  X^  and  mutually  are  statistically  independent  due  to  the 


orthogonality  of  functions  cos  U/ t  and  sin  LJ  t,  included  in  expression  (12.99) , 


and  a  normal  law  of  noise  distribution. 


Magnitudes  X,  and  with  varied  numbers  i  also  are  statistically  independent 
since  the  integration  intervals  corresponding  to  them  in  expressions  (12.99)  do 
not  overlap.  Thus,  magnitude  y,  determined  from  expression  (12.98),  is  the  /204 
sum  of  the  squares  2n  of  independent  normal ly-distributed  random  magnitudes  with 
zero  mean  values  and  identical  dispersions  1  . 

Therefore,  when  <T|*  =  1,  magnitude  y  has  law  of  distribution  yjn  (chi-square, 

with  2n  degrees  of  freedom).  This  law  of  distribution  has  been  tabularized  (see 
the  attachment,  for  example);  meanwhile,  magnitude  X2n(a)  ,  where  xL (a)  is  that 
magnitude,  the  probability  of  overrun  of  which  is  the  total  magnitude  y  (where 
0|*  =  1)  equals  Ct  ,  i.  e. ,  where  <T|*  =  1,  usually  is  presented  in  tables 


P[y>.tln(a.)  1  =a. 


(12.101) 


In  our  case,  dispersion  <r,*  does  not  equal  unity  and  is  determined  from  ex¬ 
pression  (12.100).  Therefore,  instead  of  (12.101),  we  should  assume: 


P  |y>o?X2«  (<*)]  =«• 


(12.102) 


It  follows  from  (12.95)  that  the  probability  of  satisfying  inequality  '12.95) 
when  there  is  no  signal  is  the  false-alarm  probability,  i.  e.,  when  there  is  no 
signal 


P(y>u,)~P*  t- 


(12.103) 


It  follows  from  comparison  of  (12.102)  and  (12.103)  that 


x*n  (P fl  t)* 


(12.104) 


or,  considering  (12.100) 


a 


+  (12.105) 


things?* 
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m  compute  Xan  (/VT)from  this  formula  for  given  magnitudes 
r*»  and  then,  using  the  xL(a)  tables  of  distribution,  to 
g  value  of  the  argument,  i.  e. ,  false-alarm  probability 


We  nee  will  examine  the  case  where  there  is  a  signal . 

When  there  is  a  signal,  the  probability  that  inequality  <12 . 
is  correct  signal  detection  probability  Pa  «  ,  i.  a.. 


is  satisfied 


P  i»>Vk=P**  -1-*V 


where  magnitude  y,  determined*  from  relationship  (12.91$),  is  no»  signal- plus- noise 


We  will  assume  for 
in  accordance  with  the  ( 
as  random  and  equally  pi 


law*  ,  The  phsm 


pulse  amplitude  is  distributed 
of  these  pulses  wee  accepted  above 


law*  Here,  amplitude  and  phase 
packet  pulses.  This  signifies  I 
ti  +  X  have  s  normal  lam  of  < 
that  the;  noise  hear*  .  Here,  sign* 


eluted  accordance  with  the  Ray  leigh 
satiatically  independent  for  various 
wouasignal  voltage  values  at  moments 
&  •*,  the  sews  law  of  distribution 
.«jpo,teee  m  normal  lam  of  distribution. 


'  Therefore,  when  those  is  a  signal,  the 
XA  and  YA  in  axpreaaion  (12.98)  and,  consagu 
ee  men,  rmaein  identical  to  that  when  only 
'  o,* '  is  determined  now*  hat  ftom  expression 
of  [l*  dkie  to  signal  action  where,  ih  accor 


jam  of  distribution  of  terms  /2Q5 
ently,  the  mafnitude  y  lew  of  distribution 
noise  ie  present.  However,  dispersion 
(12.168),  but  increases  by  a  factor 
dance  with  (9.78) 


there  ia  a  signal,  instead  of  (12.100),  this  expression  occurs 


'  --  *.  *.  A. 


,*=¥(.  +%i). 


(12.107) 


Since,  when  there  is  a  signal,  magnitude  y  has  the  identical  distribution 
as  is  the  case  when  there  is  no  signal,  then  formula  (12.102)  remains  valid. 


On  the  other  hand,  as  noted  above,  when  there  is  a  signal 

P{y>UJ  =  P„  0. 


When  this  expression  is  compared  with  (12.102),  we  have 


ahln(Pu  o)=l/«- 


Considering  (12.97)  and  (12.107),  finally  we  obtain: 


(12.108) 


It  is  possible  to  compute  from  this  formula  for  given 

M  P  (0),'P  (C),  QepfN0  and  n  and  then  use  the  X*«(*)  table  of  distribution  to  find 
the  corresponding  values  of  the  argument,  i.  e.,  magnitude  ^n0  . 


If  a  priori  probabilities  P(0)  and  P(C)  are  unknown,  then  magnitudes 
and  Pnn  must  be  given.  Here,  as  follows  from  (12.107)  and  (12.108),  the  following 
relationship  determines  requisite  signal  energy 


3«p 

N*  l  7.1n  (Pn  o) 


(12.109) 


**-«•*--  \  "jr  V  1  .•  **.  •**  «•"  *"  •*  -*\  *'w  **■«  **.  « 


*  *  *L\< 


Using  the  x?n  (a)  tables  of  distribution ,  it  is  not  difficult  using  forirula 
(12.109)  to  compute  signal-to-noise  QCV!N0  required  to  provide  given  probabilities 
P.t T  and  P„p  for  given  number  of  pulses  n. 

Distribution  Xj«(«)  ,  even  in  the  most  complete  tables,  is  given  only  for 

n  ^^50.  However,  for  greater  values  of  n,  it  is  possible  to  approximate  the 
magnitude  y  distribution  [formula  (12.96)]  using  a  normal  distribution  that  /206 
is  similar  to  what  was  done  in  §  12.6.  Then,  instead  of  (12.109),  we  obtain  re¬ 
lationship 


CD 

» («i  +  a*)  V  n,  where  n>  !• 


(12.110) 


This  expression  coincides  with  the  corresponding  expression  (12.48)  obtained 
in  §  12.6  for  a  signal  with  a  known  amplitude,  with  the  exception  that,  instead 
of  signal  energy  Q,  it  contains  the  mean  statistical  value  of  this  energy  Qcv 
As  usual,  magnitudes  Ct  ?  and  Ct  are  determined  from  formulas  (12.45). 


For  large  n  values,  Kaplan  obtained  a  slightly-more  precise  expression  [16], 
approximating  distribution  Xzn  ,  not  by  a  normal  distribution,  but  with  an  expansion 
into  a  power  series  with  respect  to  a  normally-distributed  magnitude.  Here,  instead 
of  (12.110),  the  following  more-precise  expression  is  obtained: 


-jf-  «  (ct,  -|-  a,)  |^|  fH - g — J. 


(12.111) 


where  and  &  ^  are  the  usual  values. 


Given  sufficiently -large  values  of  n,  expression  (12.111)  coincides  with 
(12.110),  as  consequently  would  be  expected. 

Formula  (12.111)  is  sufficiently  precise  already  when 


n  >  15  -  20. 


The  curves  depicted  in  Figure  12.9  are  plotted  from  formulas  (12.109)  and 

(12.111). 
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Figure  12.9 

We  now  will  find  loss  f\  determined  from  relationship  (12.85) ,  in  which 
Qr p'  is  value  Qcp  occurring  when  n  =  1  and  determined  from  formula  (9.83). 


It  follows  from  formulas  (12.109)  and  (9.83)  that 


*u(p«r)  _,T 

xl «(P„,)  j 


/  In  -p — 

Uk. 


(12.112) 


Where  n  ^^^15—  20,  in  accordance  with  formulas  (9.83)  and  (12.111),  we 


obtain 


(«i  +  *i)  y  n  + 


y-" 


(12.113) 


where  Pjit^O.I  and  Pnp^O.l  ,  it  is  possible,  in  accordance  with  (12.45),  /207 


to  assume 


«,«)/  2 In -^--2.8; 

a,«  V  2ln-p! - 2,8. 


(12.45) 


Finally,  if  n^>l,  /**»£!  and  PBp  *  ,  from  (12.113)  and  (12.45), 

we  obtain 


P«p  {^2ln  -jr—+  j/zin-pL.]  y  n 


(12.114) 


It  is  possible  to  plot  the  curves  depicted  in  Figure  12.10  from  formulas 
(12.109)— (12.114)  or  from  the  Figure  12.9  curves. 
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Figure  12.10.  (a)  —  cb. 
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It  is  possible  to  draw  the  following  basic  conclusions  from  analysis  of 
the  formulas  presented  above  and  the  Figure  12.9  and  12.10  curves: 

1*  In  the  case  of  independently-fluctuating  pulses,  dependence  of  requisite 
signal  energy  on  the  number  of  pulses  n  has  (where  P no 5s 0.5  )  a  minimum,  given 
some  optimum  number  of  pulses  float  • 

2.  Magnitude  floor  will  depend  little  on  permissible  false-alarm  probability 
p„r  ,  but  will  depend  greatly  on  permissible  correct  detection  probability 

Pno  ,  increasing  as  probability  Pn n  rises.  Thus,  for  example,  where 
Poo  s  0.9,  the  result  is  «  5;  if  Pm«  *  0.99,  then  «0Bt  *  10. 

3.  The  gain  occurring  when  n  =  flour  >  in  comparison  with  n  =  1,  rises  with 

a  decrease  in  error  probabilities  PBp  and  P„t  and  will  depend  especially  radically 
on  probability  Pnp  .  Thus,  for  example,  if  where  Pnp  =  0.1  and  P„s  10“*,  /208 
maximum  gain  equals  4.2  cfi,  with  this  increasing  already  to  15  dB  where 
Pa  p  =  0.01  and  P„T  =  10"6. 

4.  It  follows  from  comparison  of  relationships  (12.48)  and  (12.110)  that, 
when  n  ^  1,  the  average  energy  required  for  detection  of  a  packet  with  indepen¬ 
dently-fluctuating  amplitudes  equals  the  energy  required  for  identically-reliable 
detection  of  a  packet  with  known  amplitudes. 

If  n  s  1,  then,  in  the  case  of  a  fluctuating  amplitude,  considerably-greater 
signal  energy  is  required  for  reliable  detection  (  P«T<0,I  and  ^ap<0tl  ) 

than  is  the  case  for  known  amplitude. 

It  follows  from  the  aforementioned  points  that,  given  high  requirements  levied 
for  fluctuating  signal  detection  reliablity  (given  slight  PBp  and  P„T  ),  it 
is  possible  to  obtain  a  significant  gain  in  requisite  signal  energy  if  an  analog 
(monopulse)  signal  is  divided  into  n  sufficiently-separated  over  time  (or  frequency 
occupation)  pulses  so  that  the  fluctuations  of  adjacent  pulses  may  be  considered 
statistically  independent.  Here,  it  is  desirable  to  select  n  &  float  • 

The  possibility  of  obtaining  a  gain  in  requisite  energy  occurs  here  because 
the  probability  that  all  packet  pulses  n  will  "fade"  (decrease  in  amplitude)  at 


the  same  time  is  significantly  lower  than  the  probability  of  such  complete  fading 
of  a  monopulse  signal. 


The  above  analysis  applied  to  a  case  of  simple  binary  detection.  The  approach 
and  basic  results  presented  in  the  preceding  chapter  are  applicable  for  simultaneous 
detection  and  recognition  of  m  possible  packets  (or  in  the  case  of  an  m-channel 
receiving  system). 


In  particular,  in  the  case  of  m  orthogonal  equiprobable  packets  with  identical 
mean  values  and  given  slight  error  probabilities  (  ^  r  <  0, 1  and  P np^O.l  ), 
requisite  packet  average  energy  Qep  is  determined  from  simple  binary  detection 
formulas  (or  curves)  if  Pn  t  is  replaced  by  P»  T !m  . 


Thus,  for  example,  formulas  (12.109)  and  (12.111)  take  the  form  /209 


j  1  P«r\ 

Ss.  -  j. 

N, 

(Pji  o) 

(12.115) 


Where  n  ^>15  —  20 


+  (12.116) 

where  (for  Pj,t<0,1  and  ) 
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12.9  Special  Features  of  Signal  Detection  and  Recognition  in  Presence  of 
Nonwhite  Noise 

The  entire  analysis  presented  above  for  signal  detection  and  recognition 
involved  noise  in  the  form  of  normal  white  noise.  S.  Ye.  Fal'kovich  [5]  for  the 


first  time  obtained  corresponding  results  for  normal  nonwhite  noise  using  the 
approach  presented  in  Chapter  3  for  reducing  nonwhite  noise  to  white  noise. 

Let  ^  (ja>)  and  Sm  (w)  be  signal  uc(t)  complex  spectrum  and  noise  um  (t)  power 
spectrum,  respectively.  Then,  the  optimum  receiver  will  comprise  (Figure  3.1) 
correcting  linear  two-port  ("whitening1*  filter)  with  transfer  constant  Kt  (/<->) 
and  receiver  /7'  optimum  for  white  noise  «n'  (0  with  spectral  energy  distribution 
(unilateral)  Nq  and  transformed  signal  uc*(t)  with  spectrum  S'  (jw)  .*  Here 


I  *,(/«)  I* 


Sm  (»)  ’ 


(12.117) 


where  b  —  some  constant: 


S'(jta)  (/»)  S  (/co). 


(12.118) 


Here 


2 1  /C,  (i«)  |*  (w)  =  2b. 


(12.119) 


uA(t), 


We  will  examine  the  problem  of  the  detection  with  recognition  of  m  signals 

.  .,  u.(t),  •  •  •»  um(t)  having  spsctrum 5,  (/»),  ....  S*  (/«) . on 

a  background  of  nonwhite  noise  «*(/)  with  power  spectrum  Sm  («) .  Then,  due  to 
what  has  been  presented  above,  the  frequency  characteristic  of  the  "whitening" 
filter  is  determined  from  relationship  (12.117)  and  the  task  will  boil  down  to 
finding  receiver  IV  ,  which  optimally  will  solve  the  problem  of  detection  with 
recognition  of  m  transformed  signals  u^*(t),  .  .  .,  i^*(t),  . 


.,  u^Ht)  an  the 


background  of  white  noise  ««'  (/)  with  spectral  energy  distribution  Nq. 


Here,  in  accordance  with  (12.117)  and  (12.118),  the  spectrum  of  the  k-th  /210 
transformed  signal  (k  =  1,  .  .  .,  m)  equals 


S*{/«) —  .  *  —  5*  (/•»). 


(12.120) 


*See  not a  on  page  54. 
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A  priori  probabilities  of  the  appearance  of  transformed  signals 
u^'Ct)  (k  =  1,  .  .  m)  coincide  with  the  a  priori  probabilities  of  corresponding 
initial  signals  u^t)  (k  :  1,  .  .  . ,  m).  Therefore,  a  priori  (inverse)  probabilities 
Py(uo)»  Py^ul^»  •  •  •»  Py(um)  may  be  found  and  compared  by  using  the  optimum  system, 
the  circuitry  of  which  is  depicted  in  Figure  11.1,  described  in  the  preceding 
section.  Here,  y(t)  should  be  understood  to  mean  signal-plus-noise  at  output 
of  the  '’whitening"  filter  with  transfer  function  (12.117).  Since  noise  «m' (0 
included  in  signal-plus-noise  y(t)  turns  out  here  to  be  white,  the  approaches 
described  in  Chapter  11  relative  to  white  noise  may  be  used  to  find  the  structure 
of  receivers  /7k  (k  s  1,  .  .  . ,  m). 

In  particular,  receiver  /7k  input  stages  may  be  realized  in  the  form  of  linear 
filters  matched  with  transformed  signals  i^’(t).  Here,  the  transfer  constant 
of  the  k-th  matched  filter  equals 

Kk  (ju)  =>  aS*  (}m) 

where  a  —  some  constant; 

Sk  (/«)*  S»  (—/<#). 

For  simplicity,  we  will  limit  ourselves  to  examination  of  the  following  case 
for  computation  of  error  probabilities  and  determination  of  the  requisite  signal- 
to-noise  ratio. 

We  will  assume  that  input  signals  u^(t),  .  .  .,  um(t)  are  equally  probable, 
orthogonal,  and  have  identical  energy  Q.  In  addition,  we  will  assume  spectra 
Sk  (/«)  (k  s  1,  .  .  .,  m)  and  5a(«)  are  such  that  transformed  signals  u^'(t), 

.  .  .,  um'(t)  retain  the  property  of  orthogonality  and  equality  of  energies  (this 
occurs,  for  instance,  if  amplitude  spectra  | <S *  (/<■>) I  (k  =  1,  .  .  .,  m)  of  all 
si<?ial8  are  identical,  while  signal  orthogonality  insures  a  sufficiently-great 
separation  of  their  activities  over  time). 

When  these  conditions  are  met,  the  task  will  boil  down  to  detection  with 
recognition  of  m  equiprobable  orthogonal  signals  u^'(t),  .  .  .,  um'(t)  with  identical 
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where 


J  I S  (finf)  \*df 
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J  \9sl 1} 

0  [om(U)\ 


(12.127) 


Formula  (12.126)  provides  the  link  between  power  ratio  Q/Gm  (/„)  ,  required 
in  the  case  of  nonwhite  noise  with  spectrum  Gm  (f)  ,  and  corresponding  ratio  Q'/Nq 
for  white  noise,  which  was  determined  in  preceding  chapters  (in  the  case  of  fluc¬ 
tuating  signals,  Q  and  Q*  are  replaced  by  Qep  and  Qep  )  • 

Consequently  factor  M  demonstrates  by  what  factor  requisite  signal-to-noise 
power  ratio  at  system  input  changes  when  nonwhite  noise  replaces  white  noise. 


Thus,  for  example,  in  a  case  of  nonwhite  noise,  formula  (12.87)  takes  the 
following  form: 


M/.>  7^7  +  ,nw  +  (/2,nr“+2lnm ]• 


(12.128) 


where  M  is  determined  from  formula  (12.127). 

As  our  illustration,  we  will  examine  the  same  example  as  was  used  in  Chapter 
3,  thereby  assuming  that  signal  and  noise  spectra  at  system  input  have  the  form 


|  S  [j2nf)  |*  =  Sjj  c" ; 
<?«,(/) -Ga,(/t)e-(,-MV*< 


(12.129) 


Substitution  of  these  relationships  into  formulas  (12.126)  and  (12.127)  provides 


i 


Q  _Q‘ 
dm  if*) 


(12.130) 


It  was  explained  in  Chapter  3  that  the  optimum  system  for  type  (12.129)  signal 
and  noise  spectra  turns  out  to  approximate  the  physically -realizable  system  only 
when  condition  (3.12)  is  met. 

Therefore,  formula  (12.130)  makes  sense  also  only  when  this  condition  /213 
is  met 


n ,  ^  i 

\'2 


(12.131) 


It  follows  from  formula  (12.130)  that  /7m //7e  >  2  •— 3  results  when  ) 

«  Q'/N,  . 

This  denotes  that,  if  the  width  of  the  noise  power  spectrum  is  even  greater 
by  a  factor  of  2—3  than  that  of  the  signal  spectrum,  then  such  noise  essentially 
acts  just  like  white  noise  during  detection  and  recognition  of  orthogonal  signals. 
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CHAPTER  THIRTEEN 


/213 


MEASUREMENT  OF  SIGNAL  ANALOG  PARAMETERS 
13.1  Measurement  of  Random  Initial  Phase  Signal  Amplitude 
Let 

lf(0 (0 +«■(<).  (13.1) 


where 


"«.*(/)=  a  cos  («>/  +  q>)  (13.2) 

is  a  signal,  the  amplitude  a  of  which  requires  measurement.  Here,  frequency  U 
is  assumed  known,  while  initial  phase  is  random  and  equally  probable,  i.  e., 
has  distribution  (9.3). 

Amplitude  a  has  known  a  priori  distribution  P(a).  Amplitude  a  and  phase 
c|j  are  assumed  to  be  analog  random  magnitudes,  i.  e.,  they  are  constant  during 
observation  cycle  (0,  T),  while  they  change  from  one  cycle  to  another  with  indicated 
probability  densities  P(a)  and  P(  c£>  ),  respectively. 

The  amplitude  measurement  problem  will  comprise,  evidently,  solution,  based 
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on  analysis  of  realization  y(t),  of  which  value  amplitude  a  will  have  during  a 
given  observation  cycle  (0,  T).  In  accordance  with  the  inverse  probability  approach 
presented  in  Chapter  8,  this  requires  computation  of  distribution  Py(a). 

for  the  case  being  examined,  distribution  Py(a)  was  computed  in  §  9.1  and 
is  determined  from  formula  (9.15).  The  problem  of  the  structure  of  the  optimum 
receiver  computing  distribution  (9.15)  or  (9.15a)  was  examined  in  the  same  section. 

Especially-simple  and  clear  results  are  obtained  if  one  assumes  that  a  priori 
distribution  P(a)  is  uniform,  while  the  signal-to-noise  ratio  is  sufficiently 
high.  Therefore,  we  will  examine  this  case  in  more  detail. 

For  an  equidimensional  a  priori  amplitude  distribution,  formula  (9.15)  /214 
takes  the  form 


(13.3) 


where  kj  —  constant  normalizing  factor. 


Assuming 


dP„[a)  _n 

— r - v, 

da 


we  will  find  the  most-probable  value  a„n  of  amplitude  a  corresponding  to  the 
curve  Py(a)  curve: 


/  2a„HM  ) 
2M  1 1  Nt  I 
T  (  2 a#  n  M  \ 


(13.4) 


where  IQ(z)  and  l^(z)  —  modified  Bessel  functions  (imaginary  argument  jz). 


Equation  (13.4)  is  transcendental;  however,  given  a  high  signal-to-noise 

ratio,  it  is  possible  to  assume,  considering  the  properties  of  Bessel  functions 

I  (z)  and  I.(z),  that 
o  i 

m3ezL, 

-^T)~ 


and 


(13.5) 


In  addition,  given  a  high  signal-to-noise  ratio,  it  is  possible  to  assume 

(13.6) 
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since,  where  z  ^  3 


uw- 


y  2m 


(13.7) 


Considering  relationships  (13.5)  and  (13.6)  and  normality  condition 


J  P,(a)da~ I, 


it  is  possible  to  reduce  expression  (13.3)  to  the  following  form,  valid  for  a 
high  signal-to-noise  ratio: 


t~w,  (€_  •»«)*. 


(13.8) 


•  j 

* 
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If  amplitude  a  is  modulated  by  message  x  in  accordance  with  the  law  /215 


a*i/,(I  -j-  mx). 


(13.9) 


where  magnitude  uQ  is 
we  obtain 


precisely  known,  then,  replacing  variable  a  with  x  in  (13.8), 


(13.10) 


where  -v»«  —  most-probable  message  x  value  and 


fc _ e*Qi.  a  u»* T 


(13.11) 


Comparing  formulas  (13.10)  and  (13.11)  with  corresponding  formulas  (6.12) 
and  (6.22)  obtained  in  Part  II  of  this  book  for  a  precisely -known  signal,  it  is 
not  difficult  to  become  convinced  that  they  are  identical.  Consequently,  given 
a  high  aignal-to-noiae  ratio,  resultant  distribution  Py(x)  for  a  random-phase 
Mf  signal  is  identical  to  that  for  a  precisely-known  AM  signal. 

The  optimum  receiver  will  reproduce  message  x  exclusively  on  the  basis  of 
analysis  of  distribution  Py(x).  Therefore,  noise  immunity  during  reception  of 
a  random* phase  AM  signal  is  identical  to  that  during  recaption  of  a  precisely-known 
signal  if  the  signal-to-noiee  ratio  is  high.  Hers,  the  error  in  message  x  repro¬ 
duction  is  described  by  the  seme  formulas  (6.17)  and  (6.22). 

Consequently,  given  a  high  signal-to-noiae  ratio,  the  random  nature  of  signal 
rf  occupation  phase  Cp  will  not  load  to  message  reproduction  deterioration. 

If  the  aignel-to-noiae  ratio  is  not  high,  than  the  resultant  analysis  is 
conoiiiorahly  more  unwieldly.  It  deMonotates  that  the  lower  this  ratio,  the  greater 
the  deterioration  in  noioe  immunity  phase  irregularity  will  lead  to. 

It  follows  from  formula  (23.5)  that,  given  a  high  eignal-to-noiae  ratio, 


output  voltage  /  of  an  optimum  receiver  operating  on  the  maximum  inverse  probability 
principle,  i.  e. ,  on  the  principle 

Y=A„a. 

is  determined  by  very-simple  relationship 

V  =  const  M,  (13.12) 

where  M  —  envelope  of  the  optimum  linear  filter  output  voltage  (at  moment  T). 
Consequently,  in  this  case,  the  optimum  receiver  will  comprise  an  optimum  linear 
filter  and  a  linear  amplitude  detector  (Figure  9.2). 

It  was  demonstrated  in  §  2.4  that,  if  only  one  signal  pulse  arrives  during 
each  observation  cycle  (0,  T),  then  the  optimum  linear  filter  may  be  replaced 
almost  without  loss  by  a  conventional  band-pass  amplifier  (for  instance,  by  a 
conventional  if  amplifier)  with  optimum  bandwidth.  In  this  case,  the  optimum 
receiver  boils  down  to  a  standard  AN  signal  receiver  comprising  a  band-pass  /216 
amplifier  and  linear  amplitude  detector. 

As  already  noted  above,  the  mean  square  of  the  error  in  normalized  message 
x  reproduction,  given  a  high  signal- to-noise  ratio,  as  usubI  is  determined  from 
formulas  (6.17)  and  (6.22),  from  which  it  folllows  that 


where 


(13.13) 


(13.14) 


We  now  wilx  find  the  mean  square  of  the  amplitude  measurement  error,  i.  e., 

2 

magnitude  (Aa  ),  where 

Aa=a„H  —  a  (13.15) 

(here,  just  as  before,  it  is  assumed  that  the  optimum  receiver  operates  on  the 
maximum  inverse  probability  density  principle). 

It  follows  from  relationships  (13.9),  (13.14),  and  (13.15)  that 

(Aa)*  =  a0*  m*  6*  (13.16) 

or,  considering  formula  (13.13) 


(13.16a) 


i.  e.,  the  mean  square  of  the  amplitude  measurement  error  will  depend  only  on 
relative  noise  energy  per  unity  of  band,  Nq,  and  observation  time  T. 

13.2  Measurement  of  Random-Phase  Pulse  Signal  Moment  t  of  Arrival 


Let 

y  (/)  =  a  (/  —  t)  cos  (tat  +  q>)  +  um  (/).  (13.17) 

where  a(t  —  X)  —  envelope  of  a  precisely-known  signal,  with  the  exception  of 
time  shift  X  ,  subject  to  measurement. 


Figure  13.1 


Since  this  is  assumed  to  be  a  pulse  signal  (Figure  13.1) ,  then 

)  (13.18) 

if  t  <x  then  J 


where  rn  —  signal  pulse  known  duration. 

Time  of  observation  T  encompasses  the  interval  of  all  possible  signal  /217 
pulse  positions  corresponding  to  all  possible  measured  shift  X  values. 

Frequency  M  is  assumed  to  be  known,  while  initial  phase  —  is  a  parasitic 
random  parameter  distributed  in  accordance  with  law  (9.3).  Parameters  X  and 

are  assumed  to  be  analog  independent  random  magnitudes,  i.  e.,  they  are  constant 
during  observation  cycle  (0,  T). 

Distribution  P  (  T  )  analysis  must  form  the  basis  for  measurement  of  moment 
T  .  It  follows  from  (8.10)  and  (13.17)  that 


P*.*(y) 


1 


(V  *&)" 


exp 


—4*  (  ly <0 — «(/  —  t)cos(orf  +  v)|*<//) 


Considering  (8.4),  (8.6),  and  (8.8),  we  have 


in  in 

P,  (t)  «  j  Pt  (t,  V)  df  -  J  kP  (T)  P  («r)  Pt.  t  (!/)  dip 

0 


in  --jj- 

je  *°  .e'K'Vdf, 

where  P(  X  )  —  a  priori  parameter  T  distribution, 


n  (t,  q>)  -  J  y  (0  a  (t—x)  cos  (w /  +  <p)  dt. 


(13.19) 


(13.20) 


while  k,  —  normalizing  constant. 


Since  interval  (0,  T)  includes  all  possible  signal  pulse  positions,  then 


r 

J  a •  (/— t)  cos*  (<■>/  +  ip )dt  —  Q, 
o 

where  Q  will  not  depend  on  X  •  Therefore,  from  (13.19)  we  have 

7m 

P„  (T)=*tP(x)  j  en  (t.  t)  dfJI< 
o 

where  —  constant  determined  from  a  normality  condition. 

It  follows  from  (13.18)  and  (13.20)  that 


J  y(i)a(t  —  t)cos(o>H V)dt.  ( 

It  is  not  difficult  to  reduce  this  expression  to  the  following  form:  / 

»|(*.<P)  AIi(t)cos(0+<P)»  ( 

where 


M,(t)  =  VX\+Y\\ 

’  +  *!!  • 

Xi-  j  y(t)  a(<— t)  cos  at  dl\ 

X 

f  +  fB 

Yt  **  j  y(I)a(t  — i)$\nutdt. 


Substituting  (13.24)  into  (13.22),  we  obtain 


where  k_  —  constant  determined  from  a  normality  condition. 


It  follows  from  (13.23)  and  (13.24)  that  M^(  T  )  is  the  envelope  with  respect 


of  oscillation 


~  'I  (*.  t) 


i.  e.,  of  oscillation 


T  +  t. 


|{t,«p)  ^=  y  (/)  a  (/  —  t)  cos  (o»/  4-  <p)  dt , 

T 

considered  as  a  function  of  phase  ^  . 


(13.27) 


We  will  show  that  function  £  M  may  be  obtained  by  passing  oscillation 
y(t)  through  an  optimum  linear  filter  matched  with  the  signal. 


When  oscillation  y(t)  is  supplied  to  the  input  of  a  linear  system,  its  output 
voltage  equals 


'W  (')«*«  f  y(z)r\(t—z)dz, 

—  m 


where  —  some  constant  magnitude,  while  (t)  —  linear  system  pulse  charac¬ 
teristic. 

In  the  case  of  an  optimum  filter  matched  with  signal  u (t), 

w 

Since,  in  the  case  being  examined,  the  signal  disappears  when 
t  a  T  +  ,  then  we  select 


ten 


and 


t 

«■«« (O--* 4  J  ?W«e(» +xm—t+z)dz. 

—  m 


Since  there  is  no  signal  when  t  T  (Figure  13.1),  then  u  (t)  s  0  /219 
where  t  7  and  ue  (t  +  t,  —  /  -f  ?)  0  where  T-Mn  —  /  +  ?<r  ,  i.  e. ,  where 

z  <;  t  _  T>  ;  therefore 


i 

“.u«(0=-A|  S  y  (2)  Me(t  +  T„ -/+?)*. 


At  the  moment  /  =  t  +  t„  ,  the  result  is 


t+»B 

"•««(*  + »«)=*«  J  y(t)ut(t)dt 


t  +  T_ 


*«  5  y{t)a{(—  T)  cm  (w/f  «!<)<//. 


This  expression  coincides  (precise  to  a  constant  factor)  with  (13.27).  Con¬ 
sequently,  envelope  Um9t4t(l)  of  optimum  filter  output  voltage  at  moment 
t  =*  T  +  T«  also  coincides  (precise  to  a  constant  factor)  with  the  envelope  with 
respect  to  Cp  of  function  £  (  T  t  (j>  ),  i.  e.,  with  magnitude  M^(  X  )(here 
and  elsewhere  it  ia  assumed  that  the  filter  output  voltage  envelope  exists,  i.  e., 
the  pulse  signal  will  comprise  at  least  several  rf  occupation  periods).  Thus, 
desired  magnitude  M^(  T  )  included  in  expression  (13.26)  actually  is  proportional 
to  the  optimum  filter  output  voltage  envelope  at  moment  f—x+r,,  »  i»  o.,  at 
the  moment  the  signal  pulse  ceases  at  filter  input  (Figure  13.1). 

But,  it  was  demonstrated  in  Chapter  2  that,  by  the  moment  the  signal  at  optimum 
filter  input  ceases,  the  voltage  at  its  output  attains  the  maximum.  Consequently, 
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magnitude  M^(  X  )  i8  proportional  to  the  peak  value  of  the  pulse  voltage  at  optimum 
filter  00  output. 

Thus,  the  basic  operation  required  to  determine  distribution  P  (  T  )  from 
formula  (13.26)  and  consisting  of  computation  of  function  M^(  X  )  may  be  accomplished 
by  using  a  linear  filter  matched  with  the  signal  and  a  linear  detector  separating 
this  filter's  output  voltage  envelope. 

Then,  if  detector  output  voltage  equals  IA.  (/)  ,  then 

Mi(t)  (T  —  tw).  (13.28) 

where  —  constant  not  depending  on  X  • 

Subsequent  operations  required  for  P^(  X  )  computation  from  formula  (13.26) 
present  no  real  difficulties  since  magnitude  Nq  and  function  P(  %  )  are  assumed 
known. 

Optimum  receiver  structure  especially  is  simplfied  if  one  assumes  a  priori 
distribution  P(  X  )  ia  uniform  and,  if  the  maximum  inverse  probability  density 
approach  is  used,  i.  e.,  if  one  considers  true  that  value  X  at  which  magnitude 
Py(  X  )  is  maximum.  In  this  case,  from  (13.26)  we  have 

(13.29) 

where  kj  —  constant  determined  from  a  normality  condition. 

Since  I  (z)  is  a  monotonic  function  of  z,  then,  when  X  changes,  maximum  /220 
o 

inverse  probability  density  P  (  T  )  coincides  with  the  function  M^(  X  )  maximum. 
Therefore,  when  the  maximum  inverse  probability  density  criterion  is  used,  that 
value  Tva  of  parameter  X  in  which  function  M^(  X  )  is  maximum  should  be  used 
as  the  true  value.  Here,  the  optimum  receiver  boils  down  to  optimum  linear  filter 
00  matched  with  the  signal  and  inertia-free  envelope  detector  R  (Figure  13.2). 
Since,  in  this  case,  the  requirement  is  to  determine  only  the  moment  of  the  envelope 


.  (*-)  U- 


Figure  13.2  (a)  —  OF  [optimum  filter]; 
(b)  —  0  [detector]. 


Figure  13.3 


maximum,  then  response  curve  shape  plays  no  role  —  it  is  only  important  that 
this  shape  be  monotonic. 

Actually,  at  some  moment  t  =  /RP  ,  let  detector  output  voltage  t/.  (0 

have  the  greatest  value  (Figure  13.3).  At  that  moment  /NP  ,  the  value  of  envelope 
voltage  t/m*Us(0  at  optimum  filter  output  also  will  be  maximum.  Since 


M,  (t)  —  const  Um  IUX  (T  +  T.,). 


(13.30) 


then  function  H^(  t  )  will  be  maximum  where  t  —  /„p  —  x. ;  consequently,  the  most- 
probable  value  rfm  of  the  desired  parameter  is  determined  from  the  relationship 


T»* 


(13.31) 


(Here,  just  as  befors,  index  y  denotes  that  the  magnitude  of  the  most-probable 
parameter  value  also  will  differ  for  different  y(t)  realizations). 

Since  pulse  duration  t,  is  assumed  known,  then  it  suffices  for  determination 
of  magnitude  **»  to  determine  moment  /np  ,  when  receiver  output  voltage 
(/.(0  attains  the  greatest  value  (Figure  13.3). 


Thus,  for  the  given  assumptions,  measurement  of  time  %  of  pulse  arrival 
at  receiver  input  boils  down  to  determination  of  moment  /Hp  when  receiver  output 
voltage  (/.  ( t )  reaches  the  greatest  value. 


As  already  noted  above,  the  maximum  inverse  probability  density  criterion 
is  not  the  best,  in  particular  does  not  insure  the  least  mean  square  measurement 
error.  However,  it  will  be  demonstrated  later  (Chapter  19)  that,  given  a  high 
signal-to-noise  ratio,  i.  e. ,  given  a  requirement  for  high  measurement  precision, 
this  criterion  is  the  best.  Therefore,  we  will  examine  the  case  of  a  high  /221 
signal-to-noise  ratio  in  more  detail.  Here,  in  accordance  with  expression  (13.7) 
it  is  possible  to  assume 


PA  t)  =A, 


e*«i 

l/ 

Nt 


Since,  given  a  high  signal-to-noise  ratio,  the  indicator  of  the  exponential 
factor  power  is  much  greater  than  unity,  this  factor,  given  X  changes,  changes 
much  faster  than  the  expression  in  the  denominator.  Therefore,  it  is  possible 
to  assume 


P,(r)  a  At,  e*"i  («>/*», 


(13.3 


where  constant  factor  may  be  determined  from  a  normality  condition. 

In  accordance  with  expression  (13.27),  function  M^(  T  )  is  the  envelope 
with  respect  to  function 


5  (t,  <p)  =  J  y  (/)  a  (/  —  t)  cos  (mi  -I-  «r) ' 


(13.2 


y  (0  -  a  (/ — T,)  cos  (wf  +  q>,)  +  (/), 


(13.3 


where  T Q  and  —  parameter  T  and  (£)  true  values  for  a  signal  present 

at  input  during  a  given  observation  cycle. 


We  will  assume  that 


««„>!,  (13.34] 

i.  e. ,  that  the  signal  pulse  will  comprise  at  least  several  rf  occupation  periods. 
Then,  when  expression  (13.33)  is  substituted  into  (13.27),  it  is  possible  without 
disrupting  the  generality  of  the  result  to  assume  that  (p1  q  s  0. 

In  addition,  when  using  formulas  (13.27)  and  (13.33)  for  a  relatively-high 
signs  1-to-noise  ratio  in  computation  of  £  (  T  ,  ),  it  is  possible  to  ignore 

the  second  term  in  expression  (13.33).  Then,  expression  (13.27)  takes  the  form 


l (t,  9)  -  $  " a (/ - f *) 0 « - 1) cos cos (<*/  +  <T)  dt .  (13.35: 

f 

Considering  inequality  (13.34),  it  is  not  difficult  to  demonstrate  that 


»+tB 

S 


T 


a(t—x9)o(t — x)dt 


] 


COS  <p 


and,  consequently, 


(13.36] 


Since,  given  a  high  signa 1-to-noise  ratio,  the  error  in  measurement  of  /222 
magnitude  X  is  slight,  curve  P  (  X  )  has  significant  values  only  in  that  region 
where  X  approximates  Tq,  i.  e.,  given  slight  values  of  difference 

Ar-t  —  t,;  (13,37] 

therefore,  it  is  possible  to  assume 


a (t - x)  «a(/ -x9)  +a'  {( -  t,)  At+  -i-  a* (/ -t#) (At)*. 


(13.38 


V  V  .-.•’A  V  *.'  *.  a  '» 


where 


Substituting  these  expressions  into  (13.36),  we  obtain 


MT)  —  Q  +  ~  J  a(i—xt)  a'  (t— r#)  dt  -f- 


+  $  a(/-t0) a" (t—x9)dt, 


where 


Q=±  j 


is  signal  energy  not  depending  on  X  . 


f  a  (/ — 1#)  a'  (/ — 10)  dt  =  —  J  a(/— t,) — jj 


"  «'«-■» ,)  _n 
2 


j  "  a  (/  - 1„)  a"  (/  — 10)  tit  -t*  $ 


d’  a  (I  — ... 


performing  integration  by  parts,  we  obtain 


a(t  — r,)o*(/— rjd/  *  'T'  .(.-*»  'f  [Pp^-O-'jV  <<-i.il** 


Considering  these  results,  expression  (13.40)  takes  the  fonn 


j  [a'  (t—x)]*dt\ 


therefore,  formula  (13.32)  may  be  written  in  the  following  manner: 


(13. 4] 


where 


.-^.rrju=sr* 

"  L  **  Jt, 


(13.4J 


It  f ollows  from  the  normality  condition  that 


M 

JPr(T)rfT  =  |, 


X*  ©•  i 


*,  je-® <*"’•»* dx-\- 


(13.4: 


\  w  v *r  v  - .* oo *.*  *•  <  t-  •%  .% 


m 


Considering  that,  given  a  high  signs 1-to-noise  ratio,  factor  B  is  large, 
it  is  possible  in  expression  (13.43)  to  replace  the  finite  limits  with  infinite 
limits,  i.  e. ,  to  assume 

m 

kj  J 


the  result  here  is 


Pt  (t)  j/J  e-B 


(13.44) 


It  follows  from  this  formula  that  distribution  P^(  X  )  is  a  gaussian  curve 
having  a  maximum  when  X  s  X  Q*  i*  e*»  ^or  the  true  value  of  the  measured  parameter. 
However,  instead  of  (13.44),  it  is  more  correct  to  assume 


(13.45) 


where  —  most-probable  value  (i.  e.,  value  corresponding  to  the  maximum 

of  curve  P  (  X  )]»  which  approximates  X  Q,  but  it  is  not  mandatory  that  it 
coincide  precisely  with  it. 

Actually,  it  was  pointed  out  above  that  the  optimum  receiver  supplies  /224 
voltage  proportional  to  Py(  X  )»  therefore,  in  principle,  it  is  possible  absolutely 
precisely  to  determine  value  xv„  ,  which  corresponds  to  the  curve  Py(  X  )  maximum. 
If  Tim,  always  coincides  with  T  ,  then  this  would  signify  that  the  measurement 
error  always  equals  zero.  But,  this  contradicts  expression  (13.44)  since  it  is 
evident  from  it  that  distribution  P^(  X  )  differs  from  the  delta- function  and, 
consequently,  there  always  is  a  finite  probability  of  some  measurement  error  values. 
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This  contradictory  result  was  obtained  because,  when  computing  M^(  X  ), 
we  assumed  in  expression  (13.33)  a  noise  voltage  not  only  slight  compared  to  the 
signal,  but  also  identical  with  zero.  This  led  to  the  fact  that,  instead  of 
approximate  value  n  of  the  measured  parameter,  its  true  value  TQ  was  included 
in  expression  (13.44). 

Thus,  if  the  signal-to-noise  ratio  is  high,  but  does  not  equal  infinity, 
distribution  P  (  X  )  is  determined  from  formula  (13.45)  rather  than  (13.44). 

We  will  compare  expression  (13.45)  obtained  for  a  random  initial  phase  signal 
with  the  corresponding  expression  for  a  precisely-known  signal. 

Given  a  high  signal-to-noise  ratio  and  a  precisely -known  signal,  relationships 
(6.12)  are  valid,  i.  e.. 


where 


(6.12) 


Here,  x  —  normalized  value  of  the  measured  parameter,  i.  e., 


t  =  t#(l  +*«»), 


(13.46) 


where 


—  !<•*<  I. 


Replacing  variable  x  with  X  in  (6.12),  in  accordance  with  expressions  (8.1), 
(8.2),  and  (13.46),  we  will  obtain 


Pv  (*)  -  yjf  e"*’  <,-T*  »>\ 


where 


(13.47) 


Initially, 
will  depend  on 


we  will  examine  that  signal  u  ^ 
X  i  i«  6*  ) 


(t)  for  which  only  the  envelope 


«,(/)  =a(/ —  t)cos(o>/ -|-(p),  where  )  r,;  /225 

(13.48) 


«»(/)=  0,  outside  these  limits 

Substituting  (13.48)  into  (13.47)  and  assuming  that  U  T  ^1,  we  will  obtain 


where 


(13.49) 


Comparing  this  result  with  corresponding  formulas  (13.42)  and  (13.45)  obtained 
above  for  an  identical  signal,  but  which  has  random  phase  ,  we  will  be  convinced 
of  their  complete  coincidence. 


Consequently,  if  measured  parameter  X  is  contained  only  in  the  signal  envelope 
and  the  signa 1- to-no ia e  ratio  is  high,  parameter  measurement  accuracy  will  not 
depend  on  whether  signal  initial  phase  is  precisely  known  or  random.  Therefore, 
formulas  (6.16)  and  (6.17)  are  valid  for  the  mean  square  of  the  measurement  error 
and  it  is  possible  to  assume: 


=  (*  X9 

(3tj»  -  (t— -m* 


(13.50) 


It  follows  from  relationship  (13.46)  and  Figure  13.1  that,  when  t„  <^T, 


mi  *  5  must  be  the  case;  here 


(At)»fls !  — 6*. 

4 


(13.50a) 


If  envelope  a(t)  is  described  by  formula  (6.35),  then  formula  (6.40)  is  valid 
for  ft*  ,  to  wit 


6*  _  jW> 


(6.40) 


We  now  will  examine  a  signal  of  the  type 


ut  (/)=*<»(/  —  T)cosl<a(/— T)+«p|,  where  + 


(13.51) 


Mt  (/)  =0,  outside  these  limits 


Measured  parameter  X  of  such  a  signal  modulates  both  signal  amplitude, 
as  well  as  its  phase. 

If  initial  phase  is  random  and  equally  probable  in  the  0  —  2  7T  range, 
then,  instead  of  (13.51),  it  is  possible  to  write 


«t(0=o(/— t)cos(w/-(  (p,),  where  t+t„, 


(13.52) 


where  magnitude  s  LJ  T  +  (p  also  is  random  and  equally  probable  /226 
in  the  0  —  2  7T  range. 

Consequently,  if  phase  ^  ia  random  and  equally  probable  in  the  0—  2jt  range, 
a  type  (13.51)  signal  does  not  differ  at  all  from  the  type  (13.48)  signal  examined 
above.  Therefore,  it  remains  to  examine  a  case  when  a  type  (13.51)  is  precisely 
known  (with  the  exception  of  %  ). 


Here,  substituting  (13.52)  into  (13.47)  and  assuming  U  T  1,  we  will 


obtain 


where 


(13.53) 


In  accordance  with  formula  (13.50),  the  mean  square  of  the  error  equals 

W  “£;•  (13.54) 

It  follows  from  formulas  (13.53)  and  (13.54)  that,  generally  speaking,  it 
is  possible  by  increasing  frequency  l J  to  obtain  a  measurement  error  as  slight 
as  desired.  However,  realization  of  such  high  accuracy  may  turn  out  to  be  very 
difficult  for  the  following  reasons: 

1.  Initial  phase  ^  must  be  precisely  known  at  the  point  of  reception,  which 
essentially  does  not  occur  in  a  majority  of  cases. 

2.  Increased  measurement  accuracy  obtained  by  increasing  LJ  is  achieved 
without  changing  signal  energy  or  broadening  its  spectrum.  But,  Kotel'nikov 
demonstrated  (see  §  6.3)  that,  in  this  case,  the  minimum  signal-to-noise  ratio 

at  which  the  obtained  results  still  remain  valid  increases.  Therefore,  the  decrease 
in  measurement  error  due  to  an  increase  in  frequency  U  (i.  e. ,  due  to  decrease 
in  the  signal  rf  occupation  period)  may  be  realized  only  for  very  high  input  signal 
levels  or  for  a  very  low  noise  level. 

For  these  reasons,  in  a  majority  of  essentially  interesting  cases,  minimum 
permissible  X  measurement  error  is  determined  from  formulas  (13.49)  and  (13.50), 
namely: 


(Srp 


N, 


?m' 


dt 


(13.55) 
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If  the  pulse  envelope  is  described  by  formula  (6.35),  i.  e. ,  the  pulse  has 
a  rectangular  apectrum,  bounded  by  band  w  —  ('"»  -  **)  —  (m04*S2)  t  then,  in  accordance 
with  (13.50a)  and  (6.40),  the  result  is 


3V, 

2QO*“ 


(13.56) 


formulas  (13.55)  and  (13.56)  are  equally  valid  for  precisely-known  /227 
and  for  randan  initial  phase  signals  when  the  signal-to-noise  ratio  is  high. 


Sonetines,  it  is  more  convenient  to  write  formula  (13.55)  in  the  following 

fora: 


(STji - - 

j  «*|G«  (/»)  I*  dut 

—  m 

where  G„  (/«»)  —  envelope  a(t)  complex  spectrum  (Fourier  transform). 
Since  signal  energy  equals 


(13.57) 


=  ~  J  a*  (/)  rff  =  j-  |  Gn  (/«)  |*  dw, 


(13.58) 


then  it  is  possible  to  write  formula  (13.57)  in  the  following  form: 


The  result  for  the  signal  examined  above  with  a  rectangular  envelope  frequency 
spectrum  in  the  — 1 Q  to  ii  range  is 


f-T0’' 


(13.61) 


and  formula  (13.59)  is  transformed  into  formula  (13.56). 


13.3  Measurement  of  Random-Phase  Signal  Frequency 


In  observation  cycle  (0,  T),  let  the  signal  have  the  form 


ue  (0  “  a  (/)  cos  («.>/  -f  <p)r. 


(13.62) 


where  a(t)  --  known  time  function?  ^  —  measured  parameter  with  a  priori  distri¬ 
bution  P(  w  );  Cp  —  parasitic  random  parameter  with  distribution  (9.3). 

Accomplishing  insertions  analogous  to  those  made  in  the  preceding  section, 
instead  of  (13.25)— (13.27),  we  will  obtain  the  following  relationships: 


where  —  normalizing  constant; 


(13.63) 


M,H  »/*,•  +  >'•* : 
r 

Xt-lyU)a(t)Qosatdt; 

0 

T 

K*  —  fy(t)a  (0  sin  «»/  dt, 
o 


(13.64) 


or  M2(  U  )  is  the  envelope  with  respect  to  of  oscillstion 


5,  (w, (p)  =*  (/) <J (/) cos (( +  f ) dt. 


(13.65) 
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It  follows  from  (13.65)  that  magnitude  M^(  LJ  )  for  every  value  of  frequency 
^  is  proportional  to  the  value  of  the  envelope  (at  moment  t  :  T)  of  the  oscillation 
at  output  of  an  optimum  linear  filter  matched  with  signal 

(/)  =  a  (t)  cos  («/  +  if). 

Absence  of  data  on  signal  phase  £p>  is  no  obstacle  to  design  of  this  filter  since 
the  filter  at  output  must  supply  the  correct  value  only  of  the  rf  oscillation 
envelope.  Therefore,  it  is  possible  to  assume  any  phase  (£)  ,  such  as  one  equalling 
zero,  during  filter  design. 

The  main  difficulty  is  that  function  P  (  \J  )  and,  consequently,  function 
U  )  as  well,  must  be  computed  not  only  for  one  LJ  value,  but  for  an  infinite 
multitude  of  LJ  values  ranging  from  2 nf„n„  to  2n/MaHC  ,  where  fMKH  and 

Ammo  —  limits  of  possible  frequency  f  changes  in  the  anticipated  signal 

(f  =  U/2  7 T  ). 

Since  optimum  filter  structure  significantly  will  depend  on  signal  carrier 
frequency  f,  then,  strictly  speaking,  a  set  of  an  infinite  number  of  such  filters 
is  required.  (If  realization  y(t)  may  be  written  and  retained  for  a  long  time, 
then  it  is  possible  to  use  one  filter  with  tunable  center  frequency  f  of  this 
filter' 8  bandwidth.) 

No  similar  difficulty  arises  when  measuring  signal  amplitude  or  its  lag 
X  since  optimum  linear  filter  structure  will  not  depend  on  signal  anplitude 
a  or  its  time  of  arrival  •  The  aforementioned  difficulty  occurs  in  a  case 
of  frequency  measurement  and,  to  overcome  this,  anticipated  signal  pass-band 
/nm  —  /micemu8t  divided  into  n  sections  of  finite  width 


■  /nine  —fnim 


where  Af  —  magnitude  less  than  the  width  of  the  basic  portion  of  the  anticipated 
signal  spectrum.  Here,  instead  of  an  infinite  number  of  filters,  it  suffices 
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to  have  a  finite  number  n  of  such  filters,  which  makes  it  possible  to  /229 

compute  a  sufficient  number  of  points  P  (  LJ  .),  P  (  U  9),  .  .  P  (  )  to 

y  i  y  *  yn 

plot  desired  curve  P^(  LJ  ). 

We  now  will  examine  the  problem  of  ultimate  frequency  measurement  precision 
when  the  signal- to-noise  ratio  is  high.  Here,  as  in  previous  cases,  we  will  assume 
that  a  priori  distribution  P(  U  )  of  the  measured  parameter  is  uniform.  Then, 
analogous  to  (13.32)  we  obtain 


?*.<*» 

P,  t  (13.66) 

where  k2  —  normalizing  constant,  while  M2(  1*7  )  is  the  envelope  with  respect 
to  C|?  of  function  ^  (  {J  ,  ^>),  determined  from  expression  (13.65). 

It  is  possible,  given  a  high  signal- to-noise  ratio,  to  assume  in  expression 
(13.65)  in  the  first  approximation 

y  (*)-«(/)  cos 

where  ^ Q  and  ^PQ  —  true  values  of  signal  frequency  and  initial  phase  occurring 
during  a  given  observation  cycle  (0,  T). 

Then,  from  (13.65),  we  will  obtain 

r 

l»  (<*».  9)  =»  ^  «*  (/)  cos  («>,  t  +  q>»)  cos  (w/  +  q>)  dt. 


Evidently,  without  disrupting  the 
«  0  in  this  expression;  then 


generality,  it  is  possible  to  assume 


5i  («>,  f)  aXcosf — Y sirup 


M*  (<»»)  =  |  /Y*-f  >‘4. 


(13.67) 


where 


X  =  f  a1  (/)  cos  m„  t  cos  Ml  dt\ 
o 

T 

Y  —  J  a*  (/)  cos  m0 1  sin  m/  dl . 
o 


(13.68) 


Ue  will  designate 


Aw  sat  10  —  (<)| 


(13.69) 


and  we  will  assume  that 


and,  consequently,  and 

Ao>r <  I. 


(13.70a) 


and  that 


'o,  T >>  ]. 


(13.70b) 


It  follows  from  (13.70a)  that,  in  formulas  (13.68),  it  is  possible  to  assume  /230 


cos  Aw/  »  1  — 
sin  Aw/  m  Aw/. 


(13.71) 


Considering  relationships  (13.69),  (13.70b),  and  (13.71),  it  is  possible 


’  a1*.’'-  a*'  -  •  a'*  */  '*.  ** 


to  use  relatively-urwieldly  but  uncomplicated  transformations  to  reduce  formulas 
(13.67)  and  (13.68)  to  the  following  form: 

M*  (»)  =  <?—  (»■» — <•>„)*.  q3/ 


where 


Q  =  -15a»(/)d/ 


is  relative  signal  energy ,  while 


Substituting  expression  (13.72)  into  (13.66),  we  will  obtain 


/>,<•> 


where  —  normalizing  constant. 

For  the  identical  reasons  which  led  to  replacement  of  expression  (13.44) 
by  (13.45),  it  is  more  correct  to  replsce  true  signal  frequency  value  U Q  in 
(13.75)  by  its  most-probable  value  ,  i.  e.,  to  assume 


Pf  (»)  — 


Considering  the  normality  condition,  i.  e.,  assuming 


k ,  "T"*  e“  ■)*  da  *  *,  $  e-  7^, (•"**»  »)’  da  —  I , 


we  will  obtain 


'•.<-> -Kg 


Consequently,  given  a  high  signs 1-to-no is e  ratio,  distribution  Py(  U  ), 
just  like  Py(a)  and  P^(  T  )  are  described  by  a  gaussian  curve.  Therefore,  the 
mean  square  of  the  frequency  measurement  error  equals 


where  b2  is  determined  by  formulas  (13.73)  and  (13.74). 


It  was  noted  in  Chapter  6  in  the  examination  of  the  problem  of  measuring 
the  frequency  of  a  precisely-known  signal  that  measurement  error  will  depend  on 
moment  t^,  at  which  observation  begins.  Therefore,  in  order  to  clarify  whether 
or  not  an  analogous  phenomenon  exists  in  the  case  under  examination,  we  must  in 
expressions  (13.73)  and  (13.74)  assume  the  observation  cycle  equals  t^  —  t^  +  T 
rather  than  (0,  T).  Then,  we  will  obtain 


Let  signal  amplitude  in  the  observation  cycle  be  constant,  i.  e.,  a(t)  s  aQ; 
then,  from  (13.77)  and  (13.78),  we  will  obtain 


(SS)*=l™l.=*JL'  (13.79) 

a,*r*  QT*  ‘ 

If  normalized  message  x  is  used,  as  was  the  case  in  Chapter  6,  then,  instead 
of  (13.79),  we  will  obtain 


(13.80) 


It  follows  from  formulas  (13.79)  and  (13.80)  that,  given  a  signal  with  random 
equiprobable  (ranging  from  0  —  2  7T  )  initial  phase,  the  frequency  measurement 
error  will  not  depend  on  moment  t^  when  observation  begins. 


Comparison  of  formulas  (13.80)  and  6.28b)  demonstrates  that  resultant  error 
during  measurement  of  a  random-phase  signal  frequency  is  identical  to  that  obtained 
in  the  worst  case  (i.  e.,  where  t^  a  T/2)  for  a  precisely-known  signal. 

This  result  is  understand^) le  since,  in  Chapter  6,  it  was  indicated  that, 
in  the  case  of  a  precisely-known  signal,  a  decrease  in  measurement  error  obtained 
when  t^  is  increased  may  be  realized  only  if  the  initial  phase  of  signal  rf  occupation 
is  precisely  known  at  the  point  of  reception. 


13.4  Simultaneous  Signal  Detection  and  Measurement  of  Its  Psrameters 

It  was  assumed  in  previous  sections  of  this  chapter  that  realization  y(t), 
the  basis  for  analysis  of  which  signal  parameters  (amplitude,  frequency,  and  so 
forth)  must  be  reproduced,  must  contain  the  signal.  But,  in  radar  and  in  certain 
other  instances,  there  often  is  a  requirement  to  use  analysis  of  this  same  realization 
y(t)  to  solve  the  problem  of  both  signal  parameter  measurement  and  its  /232 
detection  when  it  is  not  known  beforehand  whether  or  not  a  given  realization 
contains  signal-plus-no iae  or  just  noise. 


This  problem  of  simultaneous  signal  detection  and  measurement  of  its  analog 


parameters  to  a  significant  degree  is  similar  to  the  problem  of  simultaneous 
detection  and  recognition  of  m  equi probable  orthogonal  signals  examined  in  Chapter 
11. 


Actually,  let  the  problem  be,  for  instance,  simultaneous,  i.  e.,  with  respect 
to  the  same  realization  y(t),  determination  whether  or  not  a  given  realization 
contains  a  signal  pulse  and,  if  so,  what  this  signal's  lag  7*  (moment  of  arrival) 
is.  The  measured  parameter's  a  priori  distribution  equals  P (  T  )  and  it  is  known 
that  T  ranges  from  TMim  —  TMiNe .  The  anticipated  signal's  duration  is  t„  . 

Parameter  T  is  an  analog  magnitude  but,  in  the  first  approximation,  it 
is  possible  to  assume  that  T  may  accept  only  discrete  values  with  interval 

At  as  r,  . 

Number  m  of  such  discrete  values  equals 

r 

m—  — , 

T»  (13.81) 

where  T  —  observation  cycle,  which  must  encompass  all  possible  signal  pulse 
positions.  Therefore 

^  =  (th«hc — *f  t,, 


and 


m  -=\  IsiSLU-Isg? . 
t« 


(13.82) 


Since,  given  such  assumptions,  possible  pulse  signals  do  not  overlap  over 
time,  then  they  are  orthogonal  and  the  parameter  T  measurement  problem  will  boil 
down  to  recognition  of  which  of  m  possible  orthogonal  signals  will  be  contained 
in  given  realization  y(t).  Consequently,  the  aforementioned  parameter  digitization 
reduces  the  problem  of  simultaneous  signal  detection  and  measurement  of  its  parameter 
T  to  a  problem  of  simultaneous  detection  and  recognition  of  m  orthogonal  signals. 


Parameter  T  in  the  general  case  may  have  irregular  a  priori  distribution 
P (  T  ).  But,  as  was  mentioned  often  above  and  will  be  demonstrated  in  Chapter 
19,  in  the  majority  of  real  cases  and,  in  particular,  given  high  parameter  measurement 
accuracy,  it  is  possible  to  assume  that  distribution  P (  T  )  is  uniform.  Here, 
evidently,  it  is  possible  accordingly  to  assume  that  all  m  orthogonal  signal  values 
are  equally  probable. 

Thus,  the  task  of  simultaneous  signal  parameter  detection  and  measurement 
boils  down  in  the  first  approximation  to  the  problem  of  simultaneous  detection 
and  recognition  of  m  equally-probable  orthogonal  signals.  This  problem  was  examined 
in  detail  in  Chapters  11  and  12,  with  the  following  basic  results  obtained: 

1.  In  the  general  case,  optimum  signal  detection  without  its  recognition  /233 
requires  less  signal  energy  and  another  optimum  system  structure  than  does  simul¬ 
taneous  signal  detection  and  recognition.  Therefore,  in  the  general  case,  if 
analysis  of  given  realization  y(t)  is  the  foundation  for  obtaining  the  most-reliable 
results  in  the  case  of  both  detection  and  recognition,  there  must  be  two  receiving 
systems,  an  optimum  detection  (without  recognition)  and  an  optimum  recognition 
(without  detection)  system.  Here,  the  results  of  the  second  system  are  considered 
only  in  those  cases  when  the  first  system  establishes,  with  sufficient  reliability, 
that  a  signal  is  present  (some  elements  of  both  systems  may  be  common,  but  output 
elements  must  be  different). 

However,  when  there  are  high  requirements  for  detection  reliability  (there 
can  be  no  highly-reliable  recognition  without  highly-reliable  detection),  simultaneous 
detection  and  recognition  requires  only  slightly-greater  signal  energy  than  is 
the  case  for  detection  without  recognition.  Therefore,  when  detection  and  recognition 
reliability  requirements  are  high,  it  is  inadvisible  to  create  a  separate  system 
for  optimum  detection  (without  recognition). 

2.  A  change  in  the  number  m  of  orthogonal  signals  by  a  factor  of  10—100 
has  virtually  no  impact  on  requisite  signal  energy  if  the  permissible  false-alarm 
probability  is  slight  (  />„T<  10"3)  or  if  m  ^104  [see  formulas  (11.27)— (11.29)]. 

3.  If  />4lt<0,l  (which  usually  is  the  case),  energy  required  for  detection 


with  recognition  of  m  equally-probable  orthogonal  signals  may  be  determined  from 
simple  binary  detection  formulas,  if  in  them  is  replaced  by  P„T !m  . 

s' 

4.  If  P„  T  ■<  0,1  and  Pa  T  <  0,1  Pop (which  usually  is  the  case),  then  the  result 
is 

Pmen  ^  Pnp< 


i.  e.,  recognition  error  probability  (given  that  some  sort  of  signal  is  present) 
equals  miss  probability. 

Considering  the  above  analogy  between  recognition  of  orthogonal  signals  and 
measurement  of  analog  parameter  T  ,  it  is  possible  to  formulate  the  following 
postulations  valid  for  simultaneous  signal  detection  and  measurement  of  its  analog 
parameter  T  : 

1'  .  In  the  general  case,  two  optimum  systems  are  needed  for  given  realization 
y(t)  for  optimum  solution  of  the  problem  of  signal  detection  and  measurement  of 
its  parameter. 

The  first  must  provide  optimum  solution  of  the  detection  (without  recognition) 
problem,  while  the  second  accomplishes  optimum  parameter  measurement  (without 
detection).  Here,  second  system  data  are  considered  only  in  those  cases  when 
the  first  system  establishes  with  sufficient  reliability  that  a  signal  is  present. 
(Some  units  in  both  systems  are  common,  but  output  units  must  be  separate). 

However,  given  high  requirements  for  detection  reliability  (but,  there  /234 
may  be  no  highly-reliable  parameter  measurement  without  highly-reliable  detection), 
simultaneous  parameter  detection  and  measurement  provide  the  same  detection  reli¬ 
ability  and  require  only  slightly-more  signal  energy  than  is  the  case  for  parameter 
detection  without  measurement. 

Therefore,  it  is  inadvisible  to  create  a  separate  system  for  optimum  detection 
(without  parameter  measurement),  given  high  requirements  for  parameter  detection 
and  measurement  reliability. 
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2'.  Given  high  requirements  for  parameter  measurement  accuracy,  the  optimum 
system  is  the  one  founded  on  the  principle  of  greatest  inverse  probability  density 
P  (  T  ),  i.  e.,  the  system  supplying  that  parameter  value  T  ~t„h,  for  which 
density  P  (  X  )  is  maximum. 

t 

As  noted  earlier  and  is  demonstrated  in  Chapter  19,  this  system  is  optimum 
in  the  sense  that  it  provides  the  minimum  mean  square  measurement  error  and  in 
the  sense  of  a  broad  class  of  other  optimizations. 

It  follows  from  the  above  point  that  it  is  inadvisible  to  create  a  separate 
system  for  optimum  detection  (without  parameter  measurement),  given  high  requirements 
for  parameter  detection  and  measurement  reliability.  Therefore,  an  optimum  simul¬ 
taneous  parameter  detection  and  measurement  system  must  operate  in  the  following 
manner  (similar  to  the  action  of  the  Figure  11.2  optimum  system  for  detection 
with  recognition  of  m  orthogonal  signals).  It  must  determine  the  greatest  value 
P,  (t* „)  of  the  measured  parameter  inverse  probability  density  and  compare  it 
with  some  threshold  U  selected  in  accordance  with  permissible  false-alarm  probability 
Pat  •  If  it  turns  out  that 


P|  (Tt> «)  ^ 

then  the  decision  is  that  no  signal  is  present;  if 

then  the  decision  is  that  a  signal  is  present  at  input  and  that  T,a  is  the  desired 
value  of  measured  parameter  T  .  Thus,  for  example,  the  system  for  optimum 
measurement  depicted  in  Figure  13.2  will  supply  optimum  signal  detection  and 
measurement,  if  one  assumes  that  a  signal  is  present  only  in  those  cases  when 
maximum  detector  output  voltage  value  (Figure  13.3)  exceeds  some  pre¬ 

determined  threshold  UQ.  The  methodology  for  measurement  of  X  remains  as  before, 
i.  e.,  it  is  assumed: 


*np— V 
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3*.  It  is  permissible  to  replace  a  real  signal  with  analog  parameter  T 
of  a  system  of  m  equally-probable  orthogonal  signals  to  determine  the  signal  energy 
required  for  reliable  signal  detection  in  a  system  of  simultaneous  signal  /235 
detection  and  measurement  of  its  parameter.  Magnitude  m  may  be  determined  approx¬ 
imately  from  formula  (13.82). 

The  circumstance  that  magnitude  m  may  be  determined  only  approximately  (since 
m  orthogonal  signals  only  are  approximately  equivalent  to  one  signal  with  analog 
parameter  T  ),  does  not  play  a  large  role  since,  as  noted  above  (in  point  2), 
in  a  majority  of  cases,  an  error  in  determination  of  m  by  a  factor  of  10  or  even 
of  100  is  permissible. 

4'.  If  Pn  t  <‘0,1  (which  usually  is  the  case),  then  the  energy  required  for 
signal  detection  with  given  error  probabilities  Pn  t  and  Pn?  may  be  determined 
from  simple  binary  detection  formulas  if  P„  r  in  them  is  replaced  by  Pn  r'mm 

5'.  It  follows  from  the  above  points  that,  during  simultaneous  signal  detection 
and  measurement  of  its  parameter,  the  energy  required  to  provide  given  detection 
error  probabilities  P„  T  and  P„p  (given  alight  values  for  these  probabilities) 
may  be  computed  from  formulas  presented  in  previous  chapters  for  a  case  of  simul¬ 
taneous  detection  and  recognition  of  m  equiprobable  orthogonal  signals. 

Energy  required  to  provide  the  given  parameter  measurement  mean  square  error 
is  determined  from  formulas  presented  in  this  chapter  [from  formula  (6.40),  for 
example] . 

In  individual  cases,  requisite  energy  will  be  determined  in  the  final  analysis 
with  permissible  detection  error  probability  values  (  Pn  t  and  P np  ),  while, 
in  other  cases,  with  the  permissible  parameter  measurement  error. 

All  the  above  discussions  for  simplicity  were  applied  to  a  case  of  measuring 
lag  T  .  But,  it  is  evident  that  they  may  be  used  to  measure  other  signal 
parameters  as  well.  For  example,  if  the  requirement  is  to  carry  out  simultaneous 
detection  of  a  signal  and  measurement  of  its  frequency,  then  the  only  difference 
will  be  that  number  m  oi  the  equivalent  orthogonal  signals  must  be  determined, 
not  from  formula  (13.82),  but  from  the  formula 
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(13.83) 


_  i  i  /n*Ke — futm 
m_!  } - - - , 

where  Af  —  width  of  the  basic  part  of  the  signal  spectrum  (i.  e.,  of  that  part 
of  the  spectrum  in  which  the  main  part  of  the  signal  energy  is  concentrated,  for 
example,  70 — 80S  of  this  energy),  while  /*BII  and  —  of  possible  measured 

frequency  f  change. 

We  will  examine  as  our  example  radar  detection  and  recognition  of  targets 
located  at  different  azimuths  and  ranges.  Azimuth  6  and  lag  T  (proportional 
to  range  r)  of  each  target  are  analog  parameters,  which  may  change  in  the  range 

and  T,nm_  .  However,  when  determining  the  energy  required 

for  detection  of  a  target  with  given  error  probabilities  P*  T  and  P „„  ,  we  will 
assume  that  the  signal  reflected  from  each  target  is  equivalent  to  a  signal  which 
may  have  one  of  m  equally -probable  orthogonal  values,  where 

/236 


m=ml  m,; 

flume — A, 


mt »  1  + 


A8 


fH,  »  1  +  . 

U 


(13.84) 


Here,  A  6  —  width  of  the  radiation  pattern  with  respect  to  azimuth,  while 

tv  —  pulse  signal  duration. 

Actually,  for  each  position,  the  radiation  patterns  of  signal  pulses  shifted 
relative  to  one  another  at  intervals  tb,  2t,  and  so  on  do  not  overlap  over  time. 
Pulses  corresponding  to  radiation  pattern  positions  shifted  relative  to  one  another 
at  angle  A 6?  also  do  not  overlap  over  time.  Therefore,  none  of  the  m  possible 
signal  puls  i,  where  m  is  determined  from  formula  (13.84),  overlap  over  time  and, 
consequently,  are  mutually  orthogonal. 

Formula  (13.84)  applies  to  a  case  when  the  frequency  of  the  reflected  signal 
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is  precisely  known,  which  ususlly  is  the  case  only  for  a  stationary  target.  If 
the  target  moves  and  the  frequency  of  the  reflected  pulse  due  to  the  Doppler  effect 
may  be  found  equally  probably  in  the  range  nr* _ then,  instead  of  (13.84), 
it  should  be  assumed  that 


m  nti  m,  mt. 


(13.85a) 


where  m^  and  m2,  as  usual,  are  determined  from  formula  (13.84),  while 


i, «  I  f 

N 


(13.85b) 


where  Af  —  width  of  the  basic  part  of  the  pulse  spectrum. 

Af  and  X  are  linked  by  the  following  known  relationship  for  a  pulse  with 
sinusoidal  rf  occupation: 


tM  '  (13.85c) 

It  follows  from  formulas  (13.84)  and  (13.85)  that,  for  short-duration  pulses, 
the  result  is  m2 1,  while  m^  **  1,  i.  e.,  signal  frequency  irregularity  does 
not  increase  significantly  magnitude  m  and,  consequently,  there  is  no  need  for 
a  significant  increase  in  the  signal  energy  required  for  reliable  detection. 

Given  long-duration  pulses,  on  the  other  hand,  1  and  the  irregularity  of 

the  frequency  of  the  signal  reflected  from  the  target  significantly  increases  m. 

For  example,  let  2  ms;  %  *  2  us;  -9„„  =  360°; 

A  6  s  1°;  s  20  kHz;  Af  s  1  MHz.  Then,  in  accordance  with  formulas 

(13.84)  and  (13.85),  we  will  obtain 

m,  *  360*  mt  «  1  000,  rt «  1 


m  »  m,mt- 1  *  3,6- 10*. 
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If  signal  amplitude  is  assumed  to  be  known,  then  signal  energy  required  /237 
for  detection  with  error  probabilities  P  *  r  and  Pm<  may  be  determined  from 
formula  (11.28),  i.  e., 


£  "  (  V'nFTr  +  ln  m  +  K'n  KT  - 1  V . 


(11.28) 


This  formula  applies  to  a  case  when  the  reflected  signal  comprises  one  pulse 
or  n  coherent  pulses. 

If  the  signal  takes  the  form  of  a  packet  of  n  incoherent  pulses,  then  the 
loss  in  requisite  energy  resulting  from  this  incoherence  may  be  determined  from 
formula  (12.60)  or  from  the  Figure  12.5  curves. 

Let  P.t  t  s  10‘5  and  P„p  =  lO”1.  Then,  from  formula  (11.28),  we  will  find 
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We  will  explain  how  imprecision  in  determination  of  the  number  m  of  equivalent 
orthogonal  signals  may  impact  upon  ratio  Q/Ng  magnitude.  We  will  assume  (with 
a  large  allowance)  that  an  error  by  a  factor  of  100  on  either  side  of  the  true 
value  may  be  permitted,  i.  e.,  in  actuality,  m  may  equal  3.6  x  10^  or  3.6  x  10' . 
Then,  from  formula  (11.28),  we  will  obtain,  respectively 

■j?"  **29,  or  ~  "40, 

i.  e.,  in  this  example,  an  error  by  a  factor  of  100  in  either  direction  in  the 

determination  of  m  will  lead  to  an  error  of  less  than  0.7  dB  in  determination 

of  threshold  ratio  Q/N  . 

o 

In  the  case  of  a  fluctuating  signal,  formula  (11.29)  should  be  used  instead 
of  (11.28),  i.  e.,  one  should  assums 
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Where  Pnp~ 0,1  P„T  =  10-8  and  m  =  3.6  x  10^,  the  result  is 

^  =  240. 

In  this  case,  a  change  (decrease)  in  m  by  a  factor  of  100  also  causes  only 
a  relatively-slight  change  QcP/ Nt  of  less  than  0.9  dB  in  threshold  ratio. 

Assuming  that  signal  amplitude  and  frequency  are  precisely  known,  if  ratio 
Q/Nq  is  sufficiently  high,  the  parameter  T  measurement  error  may  be  determined 
from  formula  (13.56). 


MATHEMATICAL  METHODS  OF  STATISTICS  FOR  INVESTIGATION  OF  OPTIMUM  SIGNAL  RECEPTION 
CHAPTER  FOURTEEN 

SIGNAL  DETECTION  AND  RECOGNITION  ANALYSIS  USING  STATISTICAL  HYPOTHESIS  TESTING 
APPROACHES 

14.X  General  Comments 

In  preceding  chapters,  optimum  receiver  analysis  mainly  used  the  maximum 
inverse  probability  approach.  However,  a  broad  mechanism  developed  in  contemporary 
mathematical  statistics  makes  it  possible  significantly  to  expand  and  intensify 
the  theory  of  optimum  reception  methods  and,  in  particular,  to  evaluate  the  appli¬ 
cability  range  of  the  maximum  inverse  probability  approach.  At  present,  mathematical 
statistics  is  one  of  the  most-rapidly  developing  branches  of  mathematics  and  the 
results  it  supplies  will  find  ever-expanding  utility  in  the  theory  of  optimum 
reception  methods. 

The  main  task  in  mathematical  statistics  is  decision-making  based  on  results 
of  observing  random  magnitudes  (or  random  functions).  It  is  assumed  that  the 
result  of  observing  some  random  magnitude  (or  random  function)  is  a  system  (y^, 
yj,  •  •  *,  yn)  of  the  values  of  this  magnitude  (or  function). 


Each  observed  value  yi  (Figure  14.1)  is  called  a  sample  value,  while  the 
entire  system  (y^,  .  .  .,  yp)  is  called  a  sample.  The  number  n  of  the  sample 


Figure  14.1  Figure  14.7 

values  contained  in  a  given  sample  is  called  the  sample  size. 

Based  on  a  resultant  sample,  let  the  requirement  be  to  establish  the  lav 
of  distribution  of  the  observed  random  magnitude  (or  random  function).  It  is 
evident  that  the  problem  may  be  solved  precisely  only  when  n  — — ►  •  and,  the 
smaller  the  n,  then,  all  other  conditions  being  equal,  the  less  the  accuracy  in 
representation  of  the  desired  lav  of  distribution. 

In  many  cases,  the  functional  shape  of  the  desired  lav  of  distribution  is 
known  (for  example,  it  is  known  that  this  lav  is  normal)  and  only  one  or  several 
parameters  determining  this  distribution  sre  unknown  (for  instance,  mean  value 
and  dispersion).  Then,  the  problem  boils  dawn  only  to  establishing  these  distribution 
parameters. 

Distribution  parameters  may  be  discrete  or  analog  magnitudes.  If  they  /2J9 
are  discrete  magnitudes,  i.  e.  only  discrete  values  a^,  a^  .  .  ^  aay  be  accepted, 

then,  considering  that,  given  a  finite  sample  size,  it  is  impossible  to  establish 
with  full  precision  which  of  these  values  actually  occurs.  Only  a  hypothesis 
may  be  formed  concerning  this  or  that  parameter  value.  Here,  some  a  posteriori 
probability  of  its  validity  (correctness)  corresponds  to  each  such  hypothesis, 
which  is  called  a  statistical  hypothesis.  Consequently,  establishing  the  value 
of  a  desired  parameter  requires  solution  ("testing")  of  which  of  m  possible  hypotheses 
(a  *  Sj,  a  *  a2,  or  a  s  aR)  is  most  likely  for  a  given  sample.  Thus,  in  s  case 
of  discrete  parameters,  the  task  boils  down  to  testing  statistical  hypotheses. 
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If  desired  parameter  a  is  an  analog  magnitude,  then  the  task  is  to  use  the 

resultant  sample  (y.,  ...»  y  )  to  evaluate  the  magnitude  of  this  parameter  in 

•  ^  ^ 

some  way.  The  result  of  this  evaluation  (in  this  case  a~)  differs  from  the  para¬ 
meter's  true  value  due  to  sample  size  finitude,  i.  e. ,  there  is  a  parameter  mea¬ 
surement  error: 


Aa  =  a*  —a.  (14.1) 

Thus,  depending  on  whether  the  desired  parameter  is  discrete  or  analog,  the 
task  boils  down  to  testing  statistical  hypotheses  or  evaluating  the  distribution 
parameter  (or  several  parameters). 

It  is  not  difficult  to  become  convinced  that  these  tasks  are  equivalent  to 
the  problems  of  detection  and  reproduction  of  a  signal  on  a  noise  background. 
Actually,  detection  or  reproduction  of  a  signal  on  a  noise  background  requires 
observation  of  signal-plus-noise  (or  of  noise  alone)  realization  y(t)  to  detect 
whether  or  not  there  is  a  signal  and  (or)  establishing  the  values  of  certain  signal 
parameters  (amplitude,  delay  time,  and  so  forth). 

In  accordance  with  the  Kotel'nikov  expansion  theorem  (see  §  1.3),  realization 
y(t)  approximately*  is  characterized  by  its  n  values  (y^,  .  .  .,  yR)  taken  /240 
at  intervals  At  =  1/2/,  (Figure  14.2).  Consequently,  the  observation  result 
may  be  sample  (y,  .  .  .,  yR)  with  size 


The  signal-plus-noise  realization  law  of  distribution  will  depend  on  the 
signal  included  within  it  and,  consequently,  on  signal  parameters. 

Therefore,  if  noise  parameters  are  known,  then  the  only  unknowns  are  the 
signal  parameters.  Hare,  finding  (determination  of)  distribution  parameters  fully 
equates  to  determination  of  signal  parameters,  i.  e.,  solution  of  the  signal  detection 
or  reproduction  problem.  Only  tha  following  comments  are  required. 


♦The  error  of  such  an  approximation  is  less,  the  smaller  sample  size  n 


If  sample  size  n  strives  towards  infinity  because  observation  time  T  rises 
without  restraint  [see  formula  (14.2)],  then  the  error  in  detection  or  reproduction 
problem  solution  will  strive  towards  zero  (if  the  measured  parameter  over  time 
T  remains  unchanged). 

If  magnitude  n  strives  towards  infinity  because  At  — ►  0  where  f»  =  const 
and  T  =  const,  then  problem  solution  error  not  only  will  not  strive  towards  zero, 
but  remains  virtually  unchanged  (if  value  n  already  was  sufficiently  high  prior 
to  being  increased).  This  is  because,  in  accordance  with  the  Kotel'nikov  expansion 
theorem,  function  y(t)  essentially  completely  is  characterized  by  its  n  discrete 
values  if  n  is  sufficiently  high  and  spectral  energy  outside  the  frequency  /• 
range  is  infinitesimally  small.  This  denotes  that  a  decrease  in  interval  At  among 
sample  values  (Figure  14.2)  from  1/2/,  to  zero  essentially  provides  no  new  infor¬ 
mation  at  all  about  realization  y(t)  and,  consequently,  may  not  increase  assigned 
problem  solution  accuracy. 

If,  on  the  other  hand,  interval  At  is  increased  in  comparison  with  1/2/,  , 
then,  where  T  =  const,  some  of  the  information  on  function  y(t)  will  be  lost  and 
problem  solution  accuracy  decreases. 

A  sample  where  n  — ►  00  and  T  =  const  i  oo  >  (i.  e. ,  At - ►  0  where  T  = 

const)  is  analog  rather  than  discrete,  which  occurs  when  At  i  0. 

It  follows  from  what  has  been  stated  that  an  analog  sample  may  not  provide 
significantly-greater  problem-solving  accuracy  than  discrete  sampling  does  if, 
for  the  discrete  sample 


(14.3) 


Therefore,  when  condition  (14.3)  is  met,  it  is  immaterial  in  principle  whether 
a  discrete  or  analog  sample  is  used.  Here,  sample  type  is  selected  only  from 
the  point  of  view  of  simplicity  in  mathematical  analysis  or  design  accomplishment. 


It  is  evident  also  that,  if  some  sort  of  formula  is  obtained  in  the  /241 


form  of  a  sum  for  the  discrete  sample,  then  the  corresponding  result  for  an  analog 
sample  may  be  obtained  if,  in  this  formula,  one  assumes: 


or 


(14.4) 


ii— where  7*— const,  j 

Here,  the  sum  is  transformed  into  an  integral. 

Considering  these  comments,  finding  distribution  parameters  for  given  sample 
(yx,  .  .  .,  yn)  is  equivalent  to  detection  or  reproduction  of  a  signal  (or  of 
its  parameters)  for  given  realization  y(t)  [at  interval  (0,  T)]. 

The  task  for  signal  detection  and  recognition  is  to  differentiate  among  discrete 
signal  types.  Therefore,  this  problem  equates  to  testing  statis'  ical  hypotheses. 
Measurement  of  analog  signal  parameters  is  equivalent  to  estimating  distribution 
parameters. 

Use  of  the  statistical  hypotheai8  testing  approach  for  signal  detection  and 
recognition  problems  is  examined  in  this  and  subsequent  chapters  (Chapters  14 
and  15).  Chapter  16  is  devoted  to  use  of  the  distribution  parameter  estimate 
approach  to  analog  signal  parameter  reproduction  problems. 

A  more-general  theory  of  statistical  solutions  is  examined  in  Chapters  17 
and  18.  This  theory,  first,  makes  it  possible  to  approach  statistical  hypothesis 
testing  and  distribution  parameter  estimation  from  a  common  and  more-general  point 
of  view.  Second,  it  provides  the  opportunity  to  solve  additional,  more  complex, 
statistical  problems. 

14.2  Signal  Detection  and  Recognition  to  Teat  Statistical  Hypotheses 

As  noted  in  §  8.1,  various  types  of  detection  are  possible. 

Initially,  we  will  examine  simple  binary  detection  of  a  precisely-known  signal. 


Let  the  amplitude  of  this  signal  equal  sq.  Then,  the  detection  problem  boils 
down  to  watsblishing,  using  observation  of  realization  y(t)  [during  interval  (0, 

T)]  as  the  basis,  whether  this  realization  contains  a  signal  or  corresponds  only 
to  noise  or,  which  is  the  same  thing,  to  establish  which  of  two  possible  signal 
amplitude  values  occurred:  as  0  or  a  s  aQ. 

This  denotes  that  hypothesis  H  must  be  tested  (no  signal,  a  s  0)  relative 
to  alternative  (incompatible)  hypothesis  (signal,  a  s  aQ). 

A  signal  has  not  one,  but  several,  possible  non-zero  values  u^,  u^,  .  . 
uffl  in  complex  binary  detection  and  the  detection  problem  boils  down  to  testing 
hypothesis  Hq  (no  signal  of  any  kind)  relative  to  alternative  hypothesis  (one 

of  m  possible  signals  is  present).  But,  here,  as  opposed  to  simple  binary  /242 
detection,  hypothesis  is  called  complex. 

In  detection  with  recognition  of  m  non-zero  signals  u^,  Ug,  .  .  .,  uffl,  m  +  1 

hypotheses  will  be  subject  to  testing — Hq  (no  signal) ,  (signal  is  present), 

Hj  (signal  U2  is  present),  .  .  . ,  (signal  um  is  present).  In  this  case,  the 

zero  hypothesis  (Hq)  is  in  contrast  to  m  alternative  non-zero  hypotheses  (H^, 

.  .  .,  H  ) .  Therefore,  this  often  is  called  m-altemative  detection, 
m 

It  was  demonstrated  in  Part  III  of  this  book  that  complex  binary  detection 
(i.  e.,  detection  without  recognition)  usually  is  of  less  interest  and  less  advisible 
than  simultaneous  signal  detection  and  recognition.  On  the  other  hand,  it  was 
shown  that,  in  the  majority  of  practical  cases,  energy  required  for  simultaneous 
detection  and  recognition  of  m  possible  signals  may  be  found  from  simple  binary 
detection  formulas  if  Pa  r  is  replaced  by  Pa  t/m  in  them.  Therefore,  for  brevity, 
only  simple  binary  detection  is  examined  below. 

• 

14.3  Binary  Detection 

Initially,  we  will  assume  that  the  signal  is  precisely  taown.  In  this  case, 
the  oscillation  at  receiver  input  has  the  form 


y(0  -«,(/) +  «,„(/). 


(14.5) 


When  there  is  no  signal 


«.(')- «».(')«  0; 

when  there  is  a  signal 

««  W  "«,,(/)- He  (/). 


(14.5a) 


i.  e. ,  from  the  point  of  view  of  message  reproduction,  in  this  case  message  x 
may  have  only  two  values: 


*  =  ,t9  =  0  and  x  —  xlt 


with  a  priori  probabilities 


0)  and  P  fo)  - />  (C) 


respectively. 


One  of  two  mutually-exclusive  responses  must  be  provided  as  a  result  of  observing 
realization  y(t)  [during  time  interval  (0,  T)]:  "yes"  (signal,  x  s  x^  or  "no" 

(no  signal,  x  =  xq). 

Each  possible  realization  y,  (/)  represents  one  point  in  multidimensional  space 
(see  §  6.3).  Therefore,  from  a  geometrical  point  of  view,  the  solution  to  the 


Figure  14.3 


detection  problem  requires  that  multidimensional  space  be  divided  into  ac(jacent, 
but  non-overlapping,  regions  A  and  B  (Figure  14.3).  If  "point"  y,(t)  falls  in 
region  A,  the  decision  is  "yes"  and  the  decision  is  "no"  if  it  does  not.  /243 

Two  types  of  errors  are  possible  when  solving  this  problem— false  alarms 
(with  probability  Pa  t)  and  signal  misses  (with  probability  P up  ). 

Evidently,  false  alarms  occur  if  realization  y(t)  falls  in  region  A  when 
there  is  no  signal. 

Therefore* 


PAT™$APu(y)dy-  (14.6) 

Signal  misses  occur  if  realization  y(t)  falls  in  region  B  when  there  is  a 
signal;  therefore 


Pup  =  (  P*>  (y)  dy.  (14.7) 

Correct  detection  results  if  realization  y(t)  falls  in  region  A  when  there 
ia  a  signal.  Consequently, 

pno~  i  — Pup ^  ^  pA,  (y)dy.  (14.8) 

It  follows  from  these  formulas  or  directly  from  Figure  14.3  that,  if  region 
A  equals  zero,  then  the  result  will  bet 


*Here  and  below,  Px(y )  are  multidimen8ional  distributions,  while  J  and  J 

A  B 

are  multidimensional  (m- dimensional)  integrals;  therefore,  an  expression,  for 


’/ 


instance  of  the  type  /  Pv(y)dy  is  an  abbreviation  of  the  integral 


/•:•/ 


Px(y1»  •  .  .»  y„>dyi  •  •  .  <*y 


n'  71 


352 


If  region  B  equals  zero,  then  it  will  be 


Pn  t-I.  =  0.  (14.10) 

Consequently,  it  is  possible  to  obtain  any  relationship  between  error  proba¬ 
bilities  Pn  t  and  P nr  (any  ratio  Pn  ilPnv  value  from  zero  to  infinity)  throucfi 
appropriate  selection  of  the  boundaries  between  regions  A  and  B. 

The  optimum  receiver  must  create  a  boundary  between  regions  A  and  B  that, 
in  a  certain  sense,  insures  a  relationship  between  error  probabilities  Pa  T  and 

P np  • 

It  follows  from  relationships  (14.9)  and  (14.10)  and  from  Figure  14.3  that, 
when  there  is  a  boundary  change  leading  to  a  decrease  in  probability  P„  ,  ,  there 
is  an  increase  in  probability  P„p  and  vice  versa.  Therefore,  it  is  impossible 
to  use  selection  of  region  boundaries  to  provide  a  simultaneous  minimum  for  /244 
both  Pn  t  and  P np  and  the  optimum  receiver  may  minimize  only  a  particular 
combination  of  probabilities  Pm  and  P np  .  In  accordance  with  this,  the 
following  binary  detection  optimizations  are  the  most  widespread: 

1.  Minimum  composite  error  probability  criterion  (ideal  observer  criterion): 

Pom  -  P  (*t)  Pnr  +  P  (*i)  Pm  -  min.  (14.11) 

2.  Minimum  average  risk  criterion: 

R  ~  oP  (*•)  Pit r  +  t>P  (x,)  P„ p  =  min,  (14.12) 

where  a  and  b  —  weight  factors  based  on  the  relative  danger  of  false  alarms  and 
signal  misses  (the  greeter  the  danger  of  false  alarms  compared  to  misses,  the 
greater  the  a/b  ratio  selected). 
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Magnitude  R  determined  from  relationship  (14.12)  is  called  the  average  risk 


3.  Minimum  weighted  probability  criterion: 


2  —  cPa,7  +  dP„p  =  min. 


(14.13) 


where  c  and  d  —  weight  factors  based  on  the  relative  danger  of  false  alarms  and 
signal  misses  (the  greater  the  danger  of  false  alarms  compared  to  misses,  the 
greater  the  c/d  ratio  selected). 


4.  Neyman-Pearson  criterion: 


where 


Pop 
P*  r 


—  min 
=  const. 


(14.14) 


If  a  priori  probabilities  P(x  )  and  P(x^)  of  signal  presence  and  absence 
are  known,  then  it  is  best  to  use  criterion  1  or  2  since  these  probabilities  are 
not  considered  in  criteria  3  and  4,  i.  e.,  all  known  a  priori  information  about 
the  signal  is  not  used  completely.  Here,  it  is  possible  to  use  criterion  1  in 
those  cases  when  false  alarms  and  misses  are  equally  dangerous,  while  criterion 
2  is  used  in  the  general  case. 

If  a  priori  probabilities  P(xQ)  and  P(x^)  are  unknown,  which  occurs  in  many 
cases  (in  radar,  for  example),  criteria  1  and  2  may  not  be  used.  Criteria  3  and 
4  have  to  be  used  in  this  event. 

In  the  event  criterion  3  is  used,  weight  factors  c  and  d  must  be  given,  because 
of  physical  formulation  of  the  problem  (here,  which  will  become  clear  later,  only 
their  ratio  c/d  plays  a  role),  with  false-alarm  probability  P„  T  given  when  criterion 
4  is  used.  In  a  majority  of  practical  cases,  it  is  simpler  to  use  the  corresponding 
selection  of  magnitude  P„  T  rather  than  ratio  c/d;  therefore,  the  Neyman-Pearson 
criterion  is  used  much  more  often  than  the  minimum  weighted  error  probability 
criterion. 

It  follows  from  comparison  of  relationships  (14.11)— (14.13)  that  the  /245 


first  two  criteria  from  a  mathematical  point  of  view  may  be  considered  particular 
cases  of  criterion  3:  criterion  1  is  obtained  from  (14.13)  if  one  assumes: 


c  —  P (.*„),  d-P(.t,), 

while  criterion  2,  if  one  assumes: 

c  —  aP  (.*„),  d  =--bP(x r,); 

therefore,  we  will  examine  criterion  (14.13)  in  more  detail. 


(14.15) 


(14.16) 


The  expression  (14.13)  minimum  is  insured  by  appropriate  region  A  and  B  selec¬ 
tion,  i.  e.,  by  using  magnitudes  Pa  t  and  Pnp  ,  given  constant  factors  c  and 
d.  Therefore,  the  magnitude  z  minimum  coincides  with  the  minimum  of  combination 


where 


*  ~~  Pl)P~t~  PP.l  T> 


(14.17) 


(14.18) 


From  (14.6),  14.8),  and  (14.17),  we  have 

'  *'  = 1  -I  IP A,  (y) — pPg, (ff)l dy-  (14-19) 

A 

Consequently,  the  optimum  receiver  must  provide  the  expression  (14.19)  minimum, 
i.  e. ,  the  maximum  of  magnitude 


2' "  [  (y)-Pp„  (y)l  dy.  (14.20) 

The  integrand  is  positive  for  some  realization  y(t)  values  and  negative  for 
others.  It  is  evident  that  region  A  must  include  all  values  of  y  for  which  the 
integrand  is  positive  and  exclude  every  y  value  where  the  integrand  is  negative 
in  order  to  obtain  the  greatest  integral  zH  value.  This  denotes  that  this  condition 
must  be  met  for  region  A 
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p*.  (y)-*P*.(y)>  f>. 


i.  e., 


p»,  (y) 

p«  to) 


>P. 


(14.21) 


while  this  inequality  corresponds  to  region  B 


(14.22) 

P*.  (y) 


But,  a  "yes"  response  corresponds  to  falling  into  region  A,  while  a  "no" 
corresponds  to  falling  into  region  B.  Consequently,  when  condition  (14.21)  is 
met,  the  decision  must  be  made  in  the  optimum  receiver  that  there  is  a  signal, 
while  the  decision  is  that  there  is  no  signal  if  this  condition  is  not  met.  /246 
In  other  words,  the  optimum  receiver  must  compute  mayiitude 


P«.  to) 


(14.23) 


and  compare  it  with  its  "threshold"  P  (figure  14.4) ;  if  it  turns  out  here  that 


Hy)>  P. 


(14.24) 


the  decision  is  that  there  is  a  signal;  otherwise,  the  signal  is  assumed  to  be 
absent. 
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Figure  14.4.  (a)  —  1 (y )  computer 


Magnitude  l(y)  determined  from  expre88ion  (14.23)  is  called  the  likelihood 
factor*  since  it  equals  the  ratio  of  likelihood  functions  P^(y)  an<*  PXQ(y)  and 
characterizes  the  likelihood  of  the  signal  present  hypothesis:  the  greater  l(y), 
the  more  likely  this  hypothesis. 


It  follows  from  the  above  examination  that  optimum  receiver  structure  is 
the  same  for  the  first  three  opt imizat ions ,  threshold  P-  values  being  the  only 
difference.  In  the  case  of  criterion  3  (minimum  weicftted  probability  criterion) 


in  the  case  of  the  minimum  composite  error  probability  criterion,  as  follows  from 
(14.15) 


P 


Pi*i)  * 


(14.25) 


while,  when  the  minimum  average  risk  criterion  is  used,  as  follows  from  (14.16), 


(14.26) 


"Sometimes,  it  also  is  called  the  likelihood  ratio. 


For  normal  white  noise,  from  formula  (4.10)  we  have 


P,Ay)~ 


\ 

U  2n/V )" 


|  T 

-  rri  <nr  it 

e  A*® 


and 


p*Ay)  (iW 


,  r 

-  ir.  .1 1  «»rw 
e  * 


Therefore,  in  accordance  with  (14.23)  and  (14.5a),  the  likelihood  factor  /247 
equals 


Q  i 

/(j/)=e“  777 


T 

(rH)"eum 


f 


(14.27 


where 

Q  =  j«e*(/)d/. 


It  follows  from  (14.24)  and  (14.27)  that  the  optimum  receiver  must  decide 
that  there  is  a  signal  if  this  condition  is  met 

i>U*  (14.28 


where 


5~;rJy(/K(0*. 


These  relationships  completely  coincide  with  relationships  (5.11a)— (5.11c) 
obtained  using  different  approaches  [the  minimum  composite  error  probability  cri¬ 
terion,  in  which  is  determined  from  formula  (14.25),  was  examined  in  Chapter 
5]. 


The  optimum  receiver  operating  principle  described  above  is  called  the  likelihood 
factor  test  since  the  decision  is  made  based  on  compering  likelihood  factor  l(y) 
with  some  threshold  ^9  [condition  (14.24)]. 

It  was  demonstrated  above  that  this  principle  supplies  the  optimum  solution 
to  the  problem  for  any  of  the  first  three  optimizations.  It  will  be  demonstrated 
below  that  the  same  principle  provides  an  optimum  solution  for  the  Neyman-Pearson 
criterion  as  well,  given  proper  threshold  j3  selection.  In  other  words,  optimum 
receiver  structure  turns  out  to  be  identical  for  all  four  optimizations  formulated 
above,  the  only  difference  being  selection  of  threshold  magnitude. 

So-called  performance  curves  depicted  (qualitatively)  in  Figure  14.5  are 
the  basic  optimum  receiver  (observer)  characteristics.  Each  performance  curve 


Figure  14.5 

gives  the  dependence  of  correct  detection  probability  Pn«  on  false-alarm  probability 
P »  Tfor  given  signal-to-noise  power  ratio  q  (here  and  in  future,  q  =  Q/Nq).  It 
is  not  difficult  to  become  convinced  that  performance  curves  actually  must  have 
(qualitatively)  such  a  form,  considering  the  following: 
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1.  It  follow 8  from  relationships  (14.9)  and  (14.10)  that,  in  any  detector 


Pao~0  ^ere  0; 

PB0-1  where  I. 

Coordinates  of  points  0  and  D  in  Figure  14.5  actually  satisfy  these  /248 
relationships. 

2.  For  given  false-alarm  probability  pa  » ,  the  greater  the  signal- to-noise 
ratio  q,  the  greater  the  correct  detection  probability  must  be.  Consequently, 
the  greater  parameter  q,  the  higher  the  location  of  the  corresponding  performance 
curve  must  be. 

3.  A  specific  threshold  value  corresponds  to  each  curve  point  M. 

Actually,  it  follows  from  relationship  (14.24)  that,  for  a  given  aignal-to- 
noise  ratio  q  and  P  ~*  00  ,  PM  T  =  0  and  0  =  0  must  be  the  case  (since 
the  probability  of  exceeding  an  inf initely-high  threshold  equals  zero,  both  when 
a  signal  is  absent  and  when  it  is  present).  Consequently,  P  “  00 •  corresponds 
to  point  0. 

If  —  0,  then  the  probability  of  exceeding  such  a  threshold  equals  unity 
when  a  signal  is  and  is  not  present.  Consequently,  where  P  *'  °,  Pn  «  «■  I 
and  P„  T  =  1  must  be  the  case,  i.  e. ,  P  —  0.  corresponds  to  point  D  (F igure 
14.5).  Thus,  when  threshold  J3  changes  from  infinity  to  zero,  point  M  shifts 
along  the  performance  curve  from  the  origin  to  point  D;  one  fully-determinate 
point  on  the  performance  curve  corresponds  to  each  J2  value  and,  on  the  other 
hand,  one  fully-determinate  threshold  value  corresponds  to  each  point  M  on 
this  curve. 

4.  Performance  curve  properties  noted  in  points  1—3  are  present  in  both 
optimum  and  non-optimum  receivers. 

If  the  receiver  is  optimum  (in  the  sense  of  any  of  the  four  optimizations 
enumerated),  then  this  relationship  also  is  satisfied  for  it 


t«1>=p. 


(14.30) 


where  \jf  —  slope  of  the  tangent  to  the  performance  curve  at  given  point  M  of 
this  curve  (Figure  14.5). 

Since  P  =»  0,  results  at  point  D,  then  ‘$=*0.  at  this  point,  i.  e. ,  the 
tangents  to  the  performance  curves  are  horizontal.  At  point  0,  where  p  **  oo, , 

the  result  is  if  =  ^  ,  i.  e. ,  the  tangents  to  the  performance  curves  are  vertical. 
Hence,  it  follows  that,  in  optimum  receivers,  performance  curves  in  their  initial 
sector  (at  point  0)  have  vertical  tangents,  while  the  tangents  are  horizontal 
at  the  final  sector  (at  point  D). 

Relationship  (14.30)  validity  is  demonstrated  in  the  following  manner. 


For  given  signal- to- noise  ratio  value  q,  the  performance  curve  under  study 
provides  dependence  Pm  •  "  f(Pa  r)  for  an  optimum  receiver.  Here,  its  value 
corresponds  to  each  of  this  curve's  points,  for  instance  at  point  M  /249 

P  =*P- 

In  order  to  find  what  tg  \jf  equals,  there  is  a  requirement,  retaining  the 
receiver  as  optimum,  to  provide  an  inf initely-small  increment  d to  threshold 
J3  .  It  follows  from  Figure  14.5  that  a  threshold  J3  change  causes  a  change 
in  probabilities  Pn  o  and  Pa  t ,  i.  e., 

pno«/i(P)  and  Pflr=/,(p),  (14.32) 
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therefore 


and,  at  point  M(fi,)  • 


(14.33) 


On  the  other  hand,  it  follows  from  (14.6),  (14.8),  and  (14.20)  that  an  optimum 
receiver  provides  the  maximum  of  magnitude 

0Pj«t; 


therefore,  at  point  M,  where  fl  =*  p„  the  maximum  of  the  following  magnitude 
is  provided 


•'-/’..-XP.,  (14.34) 

It  follows  from  (14.32)  and  (14.34)  that,  when  0  changes,  function 

*'<P)=/i(P)-PJ,(W  (W*35) 

i  consequently,  this  condition  must  be 


must  have  a  maximum  at  point 
met 
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Considering  this  relationship,  from  (14.33)  we  obtsin 


Consequently,  for  s  receiver  optimum  in  the  sense  of  sny  of  the  first  three 
criteria  [i.  e.,  which  provide  the  expression  (14.17)  minimum  or  the  expression 
(14.34)  maximum],  the  performance  curves  actially  satisfy  relationship  (14.30). 

Now,  we  will  demonstrate  that  a  receiver  optimum  in  the  sense  of  any  of  the 
first  three  criteria  turns  out  optimum  also  in  the  sense  of  the  Neyman-Pearson 
criterion,  given  proper  threshold  J3  selection. 

This  requires  demonstrating  that,  for  a  given  P„  T  magnitude,  a  receiver  /250 
providing  the  magnitude  z"  maximum  [formula  (14.34)]  also  provides  the  probability 
Pa  e  maximum,  given  proper  J3  selection. 

The  performance  curve  of  a  receiver  (optimum  in  the  sense  of  the  first  three 
criteria)  for  some  signal-to-noise  ratio  q^  is  depicted  in  Figure  14.6.  Let  the 


Figure  14.6 


false-alarm  probability  be  given 

P*r  **  »• 

It  follows  from  Figure  14.6  that  fully-determinate  point  M  on  the  performance 
curve  corresponds  to  given  q^  and  n  and,  consequently,  fully-determinate  threshold 
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J3  and  correct  detection  probability  values: 


P  =a  P»'  o  of* 


(14.36) 


We  will  show  that,  for  given  values  q  =  and  Pn  T  =  Pn  n  »  selection  of 
a  receiver  having  such  a  performance  curve  and  such  a  threshold  P  —  P«,  , 

insures  that  the  probability  Pn  o  maximum  is  obtained,  i.  e. ,  satisfies  the  Neyman- 
P ear son  criterion. 

Each  receiver  type  and  each  output  threshold  value  (for  a  given  receiver 
type)  has  its  own  region  A  shape  (i.  e. ,  the  region  used  as  the  basis  for  the 
signal/no  signal  decision).  Selection  of  region  A  determines  probability  P no 
and  P*  T  magnitudes  (for  a  given  q). 

Consequently , 


P„o  =  Pn*lMfin 

P„~PaM  01. 


(14.37) 


where  notations  pn  «  [A  (0)|  and  Pn  r  [A  (P)J  denote  that  pno  and  Pxy  will  depend 
on  region  A,  which  in  turn  will  depend  on  threshold  JJ  . 

Let  region  A^(  J3  Q)  correspond  to  the  optimum  receiver  and  to  threshold 
J3  q  and  region  A ( J3  )  to  a  receiver  operating  on  another  principle  (or  having 
another  threshold  value) . 

Since  given  false-alarm  probability  Pn  n  must  be  insured  in  all  cases,  then 


P»i\Ai  (P«)l  =  p«*i 

PnAMP)\=Pnrl- 


(14.38) 


Consequently,  it  remains  to  show  that,  for  P  —  P»  and  region  A  of  the  A^(  JJ  q) 
type,  probability  Pn  n  will  be  greater  than  for  any  other  region  A  type,  i.  e., 


that 


(14.39) 


It  follows  from  (14.34)  that  region  4,  (ft,)  corresponds  to  a  receiver  /251 
insuring  the  maximum  of  magnitude 

*  *^110 — Pi  P»  t* 


It  6  •  f 

p„  „ \At  (P,)l  -p.  P,  T  M,  (P,)]  >  P„  .  [A  (P)|  -p,  p»  T  l  A  (PH. 


(14.40) 


But,  it  follows  from  (14.38)  that 

therefore,  (14.40)  coincides  with  (14.39),  which  must  be  demonstrated. 

Consequently,  a  receiver  optimum  in  the  sense  of  the  first  three  criteria 
also  is  optimum  in  the  sense  of  the  Ney man-Pea rson  criterion  if  threshold  J2 
is  selected  with  respect  to  p*  rand  q^,  as  shown  in  Figure  14.6. 

Thus,  for  any  of  the  aforementioned  optimizations,  the  receiver  has  the  identical 
structure  insuring  computation  of  likelihood  factor  l(y)  and  its  comparison  with 
threshold  J3  (Figure  14.4).  Only  threshold  J3  magnitude  will  depend  on  the  opti~ 
mization  accepted. 

- 

Threshold  J3  for  criteria  1,  2,  and  3  is  determined  from  formulas  (14.25), 
(14.26),  and  (14.18),  respectively,  and  will  not  depend  on  the  signal- to-noise 
ratio.  In  the  case  of  the  Neyman- Pea rson  criterion,  J2  is  determined  from  performance 
curves  (Figure  14.5)  for  given  P«  »  and  q  values  and,  consequently,  will  depend 
on  the  signal-to-noise  ratio. 

i 
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14.4  Detection  Characteristics.  Threshold  Signal 

Detection  characteristics  provide  the  dependence  of  incorrect  (or  correct) 
detection  probability  on  the  signal-to-noise  ratio  and  make  it  possible  to  determine 
the  threshold  signal  (signal-to-noise  ratio  threshold  value),  i.  e. ,  the  minimum 
signal  insuring  given  detection  quality.  Detection  characteristics  may  be  plotted 
from  performance  curves  and  their  form  will  depend  on  the  optimization  selected. 

In  the  case  of  the  minimum  composite  probability  criterion  (ideal  observer 
criterion),  composite  error  probability  Poa  determined  from  expression  (14.11) 
characterizes  detection  quality.  Therefore,  the  following  dependence  is  called 
the  detection  characteristic 

Pom  **  ft  (fl¬ 
it  may  be  plotted  from  performance  curves  (Figure  14.3)  in  the  following  manner: 

1.  We  determine  threshold  J3  from  formula  (14.25). 

2.  Using  the  graphical  approach,  we  will  find  the  points  (one  each  per  curve) 
on  the  Figure  14.5  performance  curves  at  which  the  slope  of  the  curve  equals  J3  , 
i.  e.  for  which 

3.  For  every  point  found,  i.  e.,  for  each  value  of  q,  we  determine  /252 
magnitudes  P «  ,  and  Pnp  and  we  compute  the  corresponding  composite  probability 
Ppm  values  from  formula  (14.11). 

4.  We  plot  the  dependence  of  P «■  on  q  from  the  values  found. 

Thus,  for  example,  when  P  s  1,  the  detection  characteristic  of  a  fluctuating 
signal  has  the  form  depicted  in  Figure  9.8. 


In  the  case  of  criteria  2  and  3,  the  following  dependencies,  respectively, 
are  called  detection  characteristics 

R-=ft(q)  and  /,(<?). 

The  methodology  for  plotting  them  is  analogous  to  that  described  above,  with 
the  exception  that  threshold  J3  is  determined  accordingly  from  formula  (14.26) 
or  (14.18),  while  magnitudes  R  and  z  are  computed  from  formulas  (14.12)  and  (14.13) 

In  the  case  of  the  Neyman-Pearson  criterion,  this  dependence  is  called  the 
detection  characteristic 

pno ~ft(4)  where  PflT=  const. 

It  will  be  plotted  from  performance  characteristics  (Figure  14.5)  in  the 
following  manner: 

1.  A  vertical  line  is  drawn  through  point  (  Pn  T  ,  0) ,  where  p«  »  —  given 
false-alarm  probability,  and  the  points  where  it  intersects  the  performance  curves 
will  be  found. 

A  P n«  magnitude  will  be  found  for  each  of  these  points,  namely  for  every 
q  value. 

2.  Dependence  Pn«  s  f^(q)  will  be  plotted  from  the  values  found. 

Such  a  characteristic  computed  from  the  Figure  9.6  curves  for  p*  »  =0.2 
is  depicted  in  Figure  14.7. 

Signal-to- noise  ratio  fan*  insuring  the  given  correct  detection  probability 
is  determined  from  the  detection  characteristic  found. 

Fully-determinate  point  M  in  a  family  of  performance  curves  (Figure  14.5) 
corresponds  to  the  value  and  given  probability  P„  T  found.  The  performance 

curve  slope  at  this  point  determines  the  requisite  threshold  Q  magnitude: 


14.5  Comparison  of  the  Statistical  Hypothesis  Testing  Approach  to  the  Inverse 
Probability  Approach 


In  accordance  with  the  inverse  probability  approach  presented  in  Parts  II 
and  III,  the  signal/no  signal  decision  is  based  on  comparison  of  inverse  probabilities 
Py(xx)  and  Py(xQ)  of  signal  presence  and  absence,  respectively.  If  the  /253 
maximum  inverse  probability  criterion  is  used,  then  the  decision  that  there  is 
a  signal  is  made  if 

P„(xl)>Pv(x<t),  (14.41) 

and  that  there  is  no  signal  in  the  opposite  case.  Here,  minimum  composite  error 
probability  Pnm  is  insured. 

It  also  was  pointed  out  that,  in  those  cases  when  the  danger  of  false  alarms 
and  signal  misses  is  not  identical,  the  decision  that  there  is  a  signal  must  be 
made  when  this  condition  is  met 

P»  0*i)  >  ^Py  (•*#)*  (14.42) 

where  ^  —  some  weight  factor  selected  that  is  greater,  the  greater  the  danger 
of  false  alarms  compared  to  signal  misses. 

It  is  not  difficult  to  become  convinced  that  these  criteria  completely  coincide 
with  the  minimum  composite  error  probability  and  minimum  average  risk  criteria 
presented  in  this  chapter. 

Actually,  considering  that 


Pv(Xt)  =  kP{Xt)Px,  (i/)  | 


and 


P„  (jc„)  =  *P(xa)  Px,(y), 


(14.43) 


condition  (14.42)  may  be  written  in  the  form 


where 


/  <*/)  >  P. 


t  (y)  = 


fit  ■  <y) 

fir.  (y) 


ii 


p  =  ij 


p  (*«) 

P(r,)  ‘ 


(14.44) 


These  relationships  completely  coincide  with  expressions  (14.23),  (14.24), 
and  (14.26)  if  you  assume 

n-J-*  (14.45) 

Consequently,  criterion  (14.42),  which  may  be  called  a  weighted  maximum  inverse 
probability  criterion,  completely  coincides  with  the  minimum  average  risk  criterion, 
i.  e.,  insures  minimum  average  risk. 

If  you  assume 


tl-l.  (14.46) 

then  condition  (14.42)  corresponds  to  the  maximum  inverse  probability  criterion 
and,  in  formulas  (14.44),  the  following  should  be  assumed 


8  —  JLilsl . 

K  P  (*i) 

These  results  completely  coincide  with  the  expressions  obtained  from  the 
minimum  composite  error  probability  criterion  (ideal  observer  criterion).  /254 
Consequently,  the  maximum  inverse  probability  criterion  coincides  with  the  minimum 
composite  error  probability  criterion  (ideal  observer  criterion). 

Statistical  criteria  3  and  4  are  used,  as  indicated  in  §  14.3,  in  those  cases 
when  a  priori  probabilities  P(xQ)  and  P(x^)  are  unknown  and,  consequently,  inverse 
probabilities  Py(x^)  and  Py(xQ)  also  are  unknown  [see  formulas  (14.43)].  Therefore, 
criteria  3  and  4  cannot  be  equated  with  criteria  based  on  comparison  of  inverse 
probabilities.  However,  as  was  demonstrated  in  §  14.3,  the  structure  of  the  optimum 
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receiver  for  criteria  3  and  4  is  identical  to  that  used  with  criteria  1  and  2— -the 
only  difference  is  threshold  J3  magnitude.  Therefore,  during  optimum  detector 
design  and  investigation  of  its  properties,  it  is  relatively  immaterial  which 
criterion  is  used. 

The  following  two  criteria  are  the  most  widespread  due  to  their  simplicity 
and  clarity: 

1.  Minimum  composite  error  probability  criterion  (ideal  observer  criterion, 
maximum  inverse  probability  criterion). 

2.  Neyman-Pearson  criterion. 

The  first  criterion  is  used  more  often  in  communications  systems,  while  the 
second  more  often  in  radar  (since  a  priori  probabilities  P(x^)  and  P(xq)  in  many 
cases  are  knowns  in  communications  systems,  while  as  a  rule  they  are  unknowns 
in  radar). 

14.6  Computation  of  the  Likelihood  Factor  for  a  Random-Parameter  Signal 

It  was  demonstrated  in  §  14.3  that,  in  the  case  of  a  precisely -known  signal, 
the  optimum  receiver  (detector)  must  compute  likelihood  factor  l(y)  and  compare 
it  with  threshold  .  Here,  the  likelihood  factor  is  determined  from  expression 
(14.23).  We  now  will  explain  the  special  features  arising  if  a  signal  has  parasitic 
random  parameters,  i.  e. , 

y  (/)=«*(«, . a„\  t)+um  (/). 

Here,  in  accordance  with  (8.11)  and  (8.12),  we  have  (assuming  that  message 
x  has  no  statistical  link  with  the  parasitic  parameters): 


PJxt)~k  f  ...  J  P{xt)P(* . . 

A*t  *a„ 

X  P i,,a,  ...  ,an  ( y )  ^®l  ■  ■  • 

Pv(x0)  =  kP(x9)  Px,  (y). 


(14.47) 
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Therefore,  condition  (14.41),  corresponding  to  the  maximum  composite  error 
probability  criterion  (or  the  maximum  inverse  probability  criterion),  may  be  written 
in  the  following  form: 


/(!/)>  fl, 


(14.48) 


where 


/255 


•  ••  I  /M<* . « 


( IJ )  dat  . .  ita„ 


l  («/)  = 


Px.  (//) 


(14.49) 


(14.50) 


Comparing  expressions  (14.48)~(14.50)  with  the  corresponding  expressions 
for  a  precisely-known  signal,  i.  e.,  with  formulas  (14.23)— (14.25),  it  is  not 
difficult  to  become  convinced  that  they  coincide,  with  the  exception  that,  when 

computing  likelihood  factor  l(y),  the  value  of  likelihood  function  P*,>a, . «„(!/), 

averaged  considering  a  priori  distribution  P  («i . «■)  of  all  parasitic  signal 

parameters,  will  be  placed  in  the  numerator. 


It  is  possible  also  to  write  expression  (14.49)  in  the  form 
/(»)=  J  ...  T  . . . «,) /«„(y)da1..don, 


where 


^*1.*), ....  «„(>) 
'  P**<y) 


(14.51) 


(14.52) 


is  the  likelihood  factor  computed  in  the  assumption  that  all  parasitic  parameters 
of  the  signal  are  known,  i.  e.,  that  the  signal  is  precisely  known. 


It  follows  from  relationships  (14.51)  and  (14.52)  that  likelihood  factor 
l(y)  corresponding  to  a  signal  with  parasitic  parameters  may  be  obtained  through 
statistical  averaging  of  likelihood  factor  (found  in  the  assumption  that 

the  signal  is  precisely  known)  relative  to  all  parasitic  parameters  a, . a„  . 
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Formulas  (14.48)— -(14.52)  were  derived  above  relative  to  the  minimum  composite 
error  probability  criterion.  All  these  formulas  remain  valid  for  the  remaining 
criteria,  as  in  the  case  of  a  precisely-known  signal,  with  the  exception  that 
threshold  J3  must  be  determined  from  formulas  corresponding  to  these  criteria 
[formulas  (14.18),  (14.26),  and  so  on],  rather  than  from  formula  (14.50). 

Optimum  detector  structure  and  error  probabilities  P«  r.  P up  and  Pom  for 
various  signal  types  may  be  found  on  the  basis  of  relationships  (14.18) — (14.50). 

However,  as  was  demonstrated  in  §  14.5,  an  approach  based  upon  computation 
of  likelihood  factor  l(y)  and  its  comparison  with  threshold  ,  fully  eqietes 
to  the  approach  based  upon  computation  and  comparison  of  inverse  probabilities 
Py(x1)  and  Py(xQ)  presented  in  Part  III  of  the  book.  Therefore,  use  of  the  likelihood 
factor  approach  to  analysis  of  specific  cases,  already  examined  in  Part  III  (detection 
of  a  random-phase,  fluctuating,  and  other  signal  types),  would  only  be  a  repeat 
of  these  results  and  is  senseless. 

Mention  only  should  be  made  that  some  mathematical  computations  may  be  /256 
simplified  (compared  to  those  presented  in  Part  III  of  the  book)  if  the  property 
of  the  optimum  detector  performance  curves  characterized  by  relationship  (14.30) 
is  used. 


It  follows  from  relationships  (14.30)  and  (14.31)  that,  for  the  optimum  detector 


dPa  o  (P)  _  a 
dP* r(fi)  P' 


(14.53) 


where  designations  P n  o  (ft)  and  Pn  t  (P)  indicate  that  correct  detection  probability 
P n a  and  false-alarm  probability  P> i  »  here  are  considered  a  threshold  J3 
function  (Figure  14.5). 

It  follows  from  formula  (14.53)  and  Figure  14.5  that 


!i 


(14.54) 


[in  the  case  examined,  integration  occurs  with  respect  to  curve  QM  (Figure  14.5) 


and  variable  jj  changes  from  infinity  (at  point  0)  to  value  jj  ,  corresponding 
to  point  M  at  which  magnitude  Pn  o  (P)  is  determined]. 

Formula  (14.54)  makes  it  possible  to  simplify  computation  of  correct  detection 
probability  Pa  o  significantly  since  false-alarm  probability  Pn  T  included  in 
it  is  determined  when  only  noise  (without  signal)  is  active,  while  determination 
of  magnitude  P«  o  without  using  expression  (14.54)  requires  examination  of  a  case 
where  signal-plus-noise  is  active. 

We  will  examine  detection  of  a  random-phase  signal  to  illustrate  these  postu¬ 
lations.  The  following  results  were  obtained  for  this  case  in  Chapter  9: 

(9.34) 


where 


Ini.  <*)-£+  In 


P  (»)_ . 
P  <«•)  ' 


(9.35) 


here  P(0)  and  P(aQ)  —  a  priori  probabilities  of  signal  absence  and  presence, 

it  © •  f 


P(0)*P(jf0)  and  P  (a0)  =  p  (Xl). 

Since  formula  (9.35)  was  obtained  from  the  maximum  inverse  probability  criterion 
(or  the  minimum  composite  error  probability,  which  is  the  same  thing),  then  rela¬ 
tionship  (14.25)  is  valid,  i.  e., 

£J2L  _  £i£»> «  b. 

P  (a.)  P  <*i> 


Therefore,  it  is  possible  to  write  formula  (9.35)  in  the  following  form:  /257 


where 


(14.55) 
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It  follows  from  these  relationships  that  a  one-to-one  link  exists  between 
z  and  J3  i  therefore,  formula  (9.34),  in  essence,  expresses  dependence  P* 
of  interest  to  us  and  expression  (14.54)  may  be  written  in  the  following  form: 

P„  0  =  (14*56) 

ec 

But,  if  follows  from  (9.34)  that 

dP»t _ Hi. 

dt  2 Q, 


therefore,  from  (14.56)  we  have 


Pn o  -  I  - e 1, (x) dx. 


(14.57) 


This  expression  fully  coincides  with  formula  (9.36)  obtained  in  Chapter  9  by  another 
more-unwieldy  approach. 


CHAPTER  FIFTEEN  /257 

SEQUENTIAL  DETECTION 
15.1  General  Comments 

The  theory  of  optimum  reception  methods  presented  above  is  based  on  the  fact 
that  signal  (or  message)  detection  or  reproduction  must  occur  during  predetermined 
observation  time  T.  However,  in  many  cases,  better  results  may  be  obtained  if 
the  observation  duration  is  not  predetermined  (i.  e.,  prior  to  beginning  the 
observation)  and  the  problem  of  when  observation  should  cease  is  solved  during 
the  observation  process  itself,  depending  on  results  obtained. 

Actually,  in  some  sequences,  noise  oscillation  realizations  may  turn  out 
to  be  so  favorable  that  reliable  detection  of  a  signal  and  reproduction  of  its 
parameters  may  occur  much  faster  than  in  other  sequences,  when  noise  realizations 
are  less  favorable.  Therefore,  if  observation  duration  T  is  not  predetermined, 
it  is  possible  to  obtain  a  significant  savings  in  observation  time  on  the  average 
(for  many  sequences). 

A  case  where  observation  duration  T  is  not  predetermined,  but  is  determined  /258 
by  the  progress  of  the  experiment  itself,  is  called  sequential  observation  (sequential 
analysis).  As  opposed  to  this,  observation  with  a  predetermined  duration  is  called 
simple  or  classic  observation. 
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It  follows  from  the  aforementioned  that  duration  T  of  the  time  of  observation 
during  sequential  observation  is  a  random  magnitude  changing  from  one  sequence 
to  another. 

In  those  sequences  in  which  noise  realizations  turn  out  to  be  favorable, 
resultant  observation  time  is  less  than  its  mean  value  T. 

On  the  other  hand,  time  T  may  turn  out  to  be  significantly  greater  than  T 
when  noise  realizations  are  unfavorable. 

The  main  advantage  of  sequential  observation  over  simple  (classic)  observation 
is  the  decrease  in  observation  time  mean  value  T  (it  is  evident  that  T  =  T  during 
simple  observation).  Its  main  drawback  is  the  random  nature  of  the  observation 
time  and  the  attendant  possibility  of  situations  where  T  turns  out  to  be  much 
greater  than  T. 

Wald  for  the  first  time  did  a  mathematical  study  of  the  sequential  observation 
(sequential  analysis)  process  and  published  an  this  subject  in  1947  [29].  In 
subsequent  years,  Middleton,  Busgang,  Blasbalg,  Bashirinov,  Fleyshman,  and  others 
successfully  used  the  Wald  vehicle  to  solve  the  problem  of  optimum  sequential 
detection  of  a  signal  on  a  noise  background  [16,  78,  85,  86,  and  121].  A  brief 
rundown  on  the  basic  results  obtained  in  these  works  is  presented  in  this  chapter. 

15.2  Sequential  Detector  Operating  Principle 

We  will  examine  sequential  binary  detection  of  a  signal.  As  a  result  of 
observation,  the  decision  must  be  "yes"  (signal)  or  "no"  (no  signal,  only  noise). 

The  observation  process  is  divided  into  a  series  of  sequential  intervals 
(steps)  of  sufficiently-short  duration  At  (Figure  15.1).  The  "observer"  (receiver) 
in  the  first  observation  step  has  signa l-plus-noise  realization  j^tt)  at  its 
disposal  (Figure  15.2a).  The  likelihood  factor  is  computed  from  formula  (14.23) 
based  on  this  realization 
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Figure  15.1 


Figure  15.2 


If  a  "yes"  or  "no"  decision  must  be  made  by  the  end  of  interval  At,  then 
the  likelihood  factor  should  be  compared  with  some  threshold  J3  (as  is  the  case 
in  a  simple  analysis)  and,  if  this  threshold  is  overrun,  the  response  is  "yes." 
Otherwise,  the  response  is  "no."  However,  during  sequential  detection,  the  /259 
time  the  experiment  ends  (i.  e. ,  the  moment  the  decision  is  made)  is  not  prede¬ 
termined;  therefore,  three,  rather  than  two,  different  answers  are  possible  at 
the  end  of  interval  At: 

1.  "Yes"  (signal). 

2.  "No"  (no  signal) 

3.  It  is  impossible  as  yet  to  decide  ("yes"  or  "no")  with  sufficient  reliability 
and,  consequently,  the  observation  must  be  extended,  i.  e. ,  move  to  the  next  obser¬ 
vation  step. 

Therefore,  two  thresholds  .  and  '>  ,  rather  than  one, 

are  established  during  sequential  detection. 


If  it  turns  out  that 


the  decision  is  "yes"  (signal)  and,  consequently,  observation  ceases;  if 


ItUfuXfii.  (15.2b) 

then  the  decision  is  "no"  (no  signal)  and,  consequently,  observation  also  ceases. 
Finally,  if  it  turns  out  that 

0t</i(ito)<fc.  (15-2c) 

then  no  decision  is  made  and  observation  continues,  i.  e. ,  a  transition  is  made 
to  the  second  step. 

In  the  second  step,  the  "observer"  (receiver)  already  has  at  its  disposal 
realization  yoj£(t),  which  corresponds  to  interval  2At  (Figure  15.2b)  and  the 
likelihood  factor  is  computed  for  it: 


(15.3a) 


Magnitude  l(y012)  again  is  compared  with  thresholds  J3 ^  and  J3  ^  and  the 
decision  is  "yes"  if 


(y*i«)  >  Pt* 


or  is  "no"  if 


/260 


(Vm)  <  Pi- 


If  it  turns  out  that 


Pi  <  /j  (Vm)  <  Pt« 


(15.3b) 


then  no  decision  is  made  and  there  is  a  transition  to  the  next,  third,  observation 
step. 

Next,  the  process  continues  in  an  analogous  manner  until,  at  some  step  n 
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likelihood  factor  1  (yni  )  finally  turns  out  not  to  be  greater  than  u  , 

or  less  than  and,  consequently,  the  appropriate  decision  will  be  made  ("yes" 
or  "no"). 

It  may  seem  at  first  glance  that,  in  several  situations,  sequential  observation 
may  last  an  infinitely-long  time.  However,  as  Wald  demonstrated,  the  probability 
that  the  test  will  end  (during  a  finite  time)  equals  unity  for  a  very  broad  class 
of  realization  y^  distributions. 

Step  n  in  which  observation  (analysis)  ceases  is  called  a  finite  step.  Evi¬ 
dently,  n  is  a  random  magnitude  changing  from  one  sequence  (concluded  observation) 
to  another.  Accordingly,  also  random  is  composite  observation  time 

r=»«A/.  (15.4) 

It  follows  from  what  has  been  stated  that,  during  sequential  observation, 
the  space  of  all  possible  realizations  y(t)  is  divided  into  three  adjacent,  but 


Figure  15.3 


non-overlapping,  regions  A,  B,  and  C  (Figure  15.3).  If,  at  the  i-th  step,  realization 

y01  i(0  falls  into  regions  A  or  B,  then  the  appropriate  "y*s"  or  "no"  decision 

is  made  and,  consequently,  observation  ceases.  If  this  realization  y,^  #1(t) 

corresponds  to  intermediate  region  C,  then  no  decision  is  made  and 
observation  continues. 

As  was  the  case  in  simple  observation,  two  types  of  erroneous  decisions  are 
possible— false  alarms  ("yes"  when  there  is  no  signal)  and  signal  misses  ("no" 


when  there  is  a  signal),  with  probabilities  P.i T  and  P»p  ,  respectively.  Regions 
A,  B,  and  C  in  the  optimum  receiver  must  be  divided  so  that  the  best,  in  a  certain 
sense,  solution  to  the  detection  problem  is  insured. 

It  is  possible  to  characterize  sequential  detection  quality  by  means  of  a 
linear  combination  of  detection  error  probability  and  mean  observation  time,  having 
the  form  /261 

R  =  c,P(0)  P)t1+ctP  (QPnp + ct  T. 

where  (15.5) 

T  *P(0)T15j  +  P(C)fl50. 

Here,  as  was  the  case  previously,  P(0)  and  P(C)  —  a  priori  probabilities 
of  signal  absence  and  presence,  respectively;  T  —  mean  (unconditional)  observation 
time  value;  T(0)  —  mean  observation  time  given  that  there  is  no  signal  (i.  e., 

mean  duration  of  those  sequences  in  which  there  is  no  signal);  T(a^)  —  mean  obser¬ 
vation  time  given  a  signal  with  amplitude  a^  (i.  e.,  mean  duration  of  those  sequences 
in  which  there  is  a  signal  with  amplitude  a^);  c^,  and  c^  —  some  weight  factors 

considering  the  relative  danger  of  false  alarms,  signal  misses,  and  greater  mean 
observation  duration. 

Evidently,  the  lower  the  R  value,  the  better  the  receiver. 

A  type  (15.5)  criterion  differs  from  corresponding  simple  (not  sequential) 
detection  optimizations  [see  (14.12),  for  example]  only  in  that,  here,  receiver 
quality,  along  with  error  probabilities,  is  characterized  also  by  mean  observation 
duration  T. 

Wald  and  Wolfowitz  demonstrated  that,  regardless  of  given  error  probabilities 
Pa  t  and  Pup  ,  weights  c^,  C2,  and  c-j,  and  a  priori  probabilities  P(0)  and 

P(C),  no  observation  method  provides  mean  observation  duration  values  T(0)  and 

T(a^)  lower  than  does  observation  based  on  the  aforementioned  sequential  computation 
of  likelihood  factors  U  G/ot).  U  (jfai)  and  so  forth  and  their  comparison  with 

thresholds  J3  ^  and 
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must  be  determined  from  formulas 


(these  formulas  are  valid  when  /’np  (fli)  <  0,5  and  Pn  t<0,5  ,  which  usually  is 

the  case). 


Sequential  detection  determined  in  this  manner  is  called  optimum  sequential 
detection. 


It  follows  from  what  has  been  said  such  detection  insures  receipt  of  minimum 

mean  durations  T(0)  and  T(a^)  and,  consequently,  minimum  mean  observation  time 
T  as  well  for  any  error  probability  Pn  *  and  Pap  values,  any  a  priori  probability 
P(0)  and  P(C)  values,  and  any  weight  factors  c^,  c and  Cy 


It  should  be  noted  that  Wald  obtained  formulas  (15.6)  with  a  simplified  assump¬ 
tion  that,  at  the  end  of  a  test,  likelihood  factor  ln(yQl  n)  turns  out  precisely 
to  equal  threshold  J3  2  or  ft e*»  so  there  is  no  so-called  "excursion 


outside  the  boundaries." 


If  At  — *  0,  then  this  assumption  is  completely  valid.  Actually,  if  the 

test  ends  in  the  n-th  step,  then  this  signifies  that,  in  the  (n— l)-th  step,  /262 

it  still  has  not  concluded  and,  consequently,  in  the  (n— >l)-th  step,  magnitude 

1  ,  (y_,  ,)  still  has  not  exceeded  the  boundaries  set  by  thresholds 

n—  1  ui.  .  . n— j. 

ft  x  and  2. 

On  the  other  hand,  the  test  is  over  in  the  n-th  step  and,  consequently,  in 

this  step  magnitude  1  (ym  )  must  exceed  one  of  the  boundaries  determined 

n  ux«  •  *n 

by  thresholds  ft  ^  and  ft  y 

But,  where  At — ►O,  the  likelihood  factor  in  the  n-th  step  is  identical  to 
that  in  the  (n— l)-th  step.  Therefore,  if  it  does  not  overrun  the  boundaries 
in  the  (n— l)-step,  then  it  may  not  overrun  either  boundary  in  the  n-th  step 
either— in  this  final  step,  it  only  coincides  with  one  of  these  boundaries. 


If  interval  At  is  finite,  then  the  result  in  the  n-th  step  may  be  .  /„  >  P* 

or  /„  <  Pi,  ,  i.  e. ,  a  transition  outside  the  boundaries  may  occur.  However, 
as  demonstrated  in  several  works,  where  n  >  1  (which  usually  is  the  case),  formulas 
(15.6)  and  the  approximate  relationships  Wald  obtained  on  their  basis  essentially 
are  accurate  enough. 

Observation  step  duration  At  selection  is  based  on  the  following  circumstances. 

Let  At^  be  that  minimum  interval  between  adjacent  realization  y(t)  sample 
values  in  which  these  values  may  be  considered  statistically  independent.  Then, 
if  At  >  At ^  is  the  selection,  mean  .observation  time  T  increases  when  interval 
At  increases.  Actually,  in  this  case,  a  At  increase  considerably  increases  infor¬ 
mation  about  observed  process  y(t)  and,  in  3ome  cases,  a  suf ficiently-reliable 
decision  may  be  obtained  also  with  less  information,  i.  e.,  even  prior  to  conclusion 
of  a  given  interval  At.  Therefore,  in  several  cases,  an  interval  At  decrease 
makes  it  possible  to  speed  up  the  decision-making  process  and,  consequently,  must 
lead  to  a  decrease  in  mean  observation  duration  T.  Consequently,  it  is  not  a 
good  idea  to  select  an  interval  At  magnitude  exceeding  Atk» 

If  At<^  At^  is  selected,  then  a  further  interval  At  decrease  does  not  provide 
a  significant  mean  observation  duration  decrease.  Actually,  in  this  case,  all 
process  y(t)  sample  values  within  interval  At  have  such  a  strong  mutual  statistical 
link  that  each  ordinate  y(t^)  taken  within  this  interval  essentially  completely 
characterizes  entire  realization  y(t)  in  this  interval.  Therefore,  further  breakdown 
of  this  interval  At  into  a  series  of  smaller  intervals  may  not  provide  any  significant 
improvement  in  detection  quality  and  (in  the  case  of  a  discrete  sample)  will  lead 
only  to  excess  system  complication  (due  to  the  increase  in  the  number  of  observation 
steps) . 

In  light  of  the  aforementioned  circumstances,  the  usual  selection  is  /263 

(15.7) 

Here,  for  the  purposes  of  computation  and  design  simplification,  systems 
usually  are  limited  to  consideration  within  each  interval  At  only  of  one  ordinate 
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y^,  i.  e. ,  replacement  of  the  analog  sample  (Figure  15.1)  by  a  discrete  sample 
(Figure  15.4). 

If  interval  At  here  is  selected  in  accordance  with  relationship  (15.7),  then 
this  replacement  of  an  analog  sample  by  a  discrete  sample,  on  the  one  hand,  does 
not  provide  a  significant  deterioration  in  detection  quality  while,  on  the  other 
hand,  it  makes  it  possible  to  consider  all  sample  values  y^,  y2,  .  .  .,  yR  sta¬ 
tistically  independent,  which  will  lead  to  very-significant  simplification  of 
the  analysis. 

15.3  Fundamental  Relationships  During  Optimum  Sequential  Detection 

From  the  point  of  view  of  mathematical  statistics,  the  relationships  presented 
below  correspond  to  the  following  problem  formulation. 

Random  sample  values  y^,  y2,  .  .  .,  yi  are  observed  sequentially.  The  multi¬ 
dimensional  law  of  distribution  of  these  values  is  fully  known  (for  any  number 
i  of  these  values),  except  for  single  unknown  parameter  6  and  is  designated 

Pn (y)  - JF  (ft. ft. ... . ft;  0).  (15.8) 

Based  on  observation,  the  hypothesis  that  P  <  must  be  tested  relative 

to  alternative  (incompatible)  hypothesis  that  (where 
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In  order  for  this  problem  formulation  to  correspond  to  the  signal  detection 
problem,  it  evidently  is  necessary  to  meet  the  following  conditions: 

1.  The  multidimensional  law  of  distribution  of  signal-plus-noise  y(t)  must 
have  only  one  unknown  parameter  0  . 

2.  Parameter  0  must  be  such  that  case  0  ^  unequivocally  means 

no  signal,  while  condition  0  >  0,,  also  unequivocally  must  correspond  to 
signal  presence. 

It  is  not  difficult  to  become  convinced  that  the  second  condition  will  be 
satisfied  if  Q  represents  signal  "strength"  (amplitude,  amplitude  mean  statistical 
value,  energy,  energy  mean  value,  signal-to-noise  power  ratio,  and  so  on)  deteimined 
so  that  it  eqials  zero  when  there  is  no  signal  (0  =  0o  —  0),  and  it  is  no 

less  than  some  predetermined  magnitude  (  e  >  0i  )  when  there  is  a  signal. 

We  will  examine  several  examples. 

1.  Signal  a  cos  («/  +  <p)  .  is  precisely  known  except  for  amplitude  a, 

which  may  have  only  two  values:  a  s  0  (no  signal)  and  a  =  a^  (signal). 

In  this  case,  signal-plus-noise  /264 

lf<0-«e(0  +  «m(0 

has  only  one  unknown  parameter-signal  amplitude*  and,  if  one  selects 


0«a;  0,  =  ot, 


(15.9) 


then  the  second  condition  also  will  be  met. 


2.  Signal  a  cos  (mt  +  <p)  is  known,  with  the  exception  of  amplitude  a 


♦Here  and  henceforth,  the  assumption  everywhere  is  that  the  noise  law  of  dis¬ 
tribution  is  completely  known. 


and  phase  cp  ,  while  phase  Cf)  is  random  and  equally  probable  in  the  0  to  2  7T 
range. 

In  this  case,  it  is  possible  to  sequentially  analyze  the  oscillation  at  the 
output  of  the  amplitude  detector  connected  beyond  the  optimum  linear  filter,  rather 
than  at  receiver  input.  Here,  detector  output  voltage  values  are  used  as  sample 
values  y^,  y,,  .  .  . ,  yi  and  amplitude  a  is  the  only  unknown  parameter  in  the 
distribution  of  these  values. 

If  the  signal,  when  present,  has  the  only  possible  known  amplitude  a^  value, 
then  it  is  possible  to  assume 


0  =*a;  0|B*d|.  (15.10) 

3.  If,  in  the  preceding  two  examples,  incoming  signal  amplitude  a  is  in  known, 
some  value  of  this  amplitude,  for  which  miss  probability  fa)  must  be  given, 
is  given  prior  to  the  observation,  with  the  assumption  being 

QsA;  0|=±d}.  (15.11) 

Composite  sequential  detection  stemming  from  this  assumption  will  be  optimum, 

i.  e.,  will  provide  minimum  mean  observation  durations  T(0)  and  T(a),  if  the  amplitude 
of  the  signal  arriving  at  receiver  input  actually  does  equal  selected  magnitude 
a^.  If  not,  receiver  action  then  will  not  be  optimum.  Theref r usually  the 
minimum  signal  amplitude  value  at  which  the  given  detection  v  must  be  provided 

is  selected  as  a^.  This  then  guarantees  that  receiver  action  ...  e  optimum 
for  the  minimun  (reliably -observable)  signal. 

Receiver  action  still  will  not  be  optimum  when  signal  amplitude  is  increased 
beyond  value  a^,  but  this  is  not  so  dangerous  since  a  signal  strength  increase 
will  lead  to  a  signal  miss  probability  decrease,  even  if  some  optimality  is  lacking 
in  receiver  action. 


4.  If  what  is  present  is  a  noise-type  signal  with  a  completely-known  law 
of  distribution,  then  the  only  unknown  parameter  of  the  distribution  of  /265 


sample  values  (y^,  .  .  . f  y .  )  is  extant  signal  voltage  value  and  it  is  possible 
to  assume: 


e-£/*;  0,-1/,*.  (15.12) 

where 

Yu^. 

If  magnitude  U ^  is  not  known  beforehand,  then,  stemming  from  the  same  circumstances 
as  in  the  preceding  example,  the  minimum  extant  signal  value  at  which  given  error 
probabilities  pnr  and  Pa  r  must  be  supplied  are  selected  as 

It  is  clear  from  what  has  been  presented  that,  in  sequential  detection,  6 
is  the  parameter  characterizing  signal  strength.  Here,  9*0  corresponds  to 
no  signal,  while  0  =  0,  corresponds  to  presence  of  a  signal  with  the  anti¬ 
cipated  strength. 

In  the  general  case,  strength  &  of  an  actually -arriving  signal  is  unknown; 
therefore,  6 ^  is  called  anticipated  strength  value,  i.  e.,  that  value  determining 
optimal  detection.  As  opposed  to  this,  6  denotes  that  signal  strength  value 
which  actually  occurs  during  a  given  detection  sequence  and  it  may  equal  either 
zero,  or  6 or  (in  the  general  case)  any  other  value. 


Prior  to  the  start  of  the  observation,  detection  error  probabilities 
P»  »  and  P gp  (9j)  are  given.  Here,  the  symbol  P« p  (®i)  signifies  that  the  given 
miss  probability  must  be  insured  in  those  cases  when  signal  strength  equals  the 
anticipated  magnitude,  i.  e.,  6^. 

Thresholds  ft  ^  and  ^  determined  in  accordance  with  formulas  (15.6)  are 
selected  from  given  Pa  »  and  Pa p  (9,)  magnitudes,  namely 


l  — Pbp(Qi)  ^ 

Pjit 


(15.13) 


Here,  P„< 0,5  and  ^(01X0,5 


*w  *.  *. 


The  following  are  basic  detector  (receiver)  characteristics  determining  the 
quality  of  its  work  and  subject  to  computation: 


f(5j®A/.n(0)f 


(15.14) 


where  T(  Q  )  and  n(  Q  )  —  mean  (statistical)  values  of  observation  time  T  and 
sample  size  n,  determined  for  the  condition  that  input  signal  strength  equals 

e . 

Since,  in  accordance  with  (15.14),  a  very-simple  one-to-one  link  exists  between 
T(  Q  )  and  n(  Q  ),  for  brevity  in  future  we  will  examine  only  mean  sample  size 
n(  6  )• 

Particular  but  important  function  n(  Q  )  values  are  n(0)  and  n(  B  j),  /266 
i.  e. ,  mean  sample  values  for  signal  absence  and  presence  of  a  signal  of  anticipated 
strength  0^  ,  respectively. 

A  typical  n(  Q  )  characteristic  is  depicted  in  Figure  15.5.  It  has  a  maximum 


Figure  15.5 


Figure  15.6 


at  some  critical  signal  strength  value  8„p  ,  while 


o<e«<e,. 


I 


The  following  explains  the  presence  of  this  maximum.  Given  sufficiently-high 
signal  strength  (Q  6}),  the  test  ends  relatively  quickly  on  the  average  since, 
here,  the  probability  is  high  that  the  total  output  effect  of  upper  threshold 
ft  2  will  be  exceeded. 

When  there  is  no  signal  (  6  =0),  the  test  also  ceases  relatively  quickly 
on  the  average  since,  here,  the  probability  is  relatively  hic^i  that  the  effect 
of  noise  alone  will  be  less  than  lower  threshold  ft  y 

If  there  is  a  signal  (  $  t  0)  but  its  strength  is  relatively  low 
( 6  <  0 ,  then  the  oscillation  at  receiver  output  [l(y)  computer]  will  be 

found  a  majority  of  the  time  between  thresholds  ft  ^  and  ft 2  and  a  relatively-long 
time  will  pass  until  this  oscillation  turns  out  to  be  below  threshold  ft ^  or 
above  threshold  ft  y 

2.  Dependence  L(f9).  called  the  function  operational  characteristic.  Here, 

L( @ )  is  the  probability  of  acceptance  when  the  test  of  no-signal  hypothesis  Hq 
ceases,  given  that  input  signal  strength  equals  6  .  Since,  following  test 
cessation,  it  is  mandatory  to  accept  hypothesis  Hq  or  H^,  then  1— L(0)  is  the 
probability  of  accepting  (following  test  cessation)  signal  hypothesis  H^,  also 
determined  given  that  signal  strength  equals  0  . 

It  follows  from  these  determinations  that 


(15.15) 


Typical  characteristic  L(  Q)  is  depicted  in  Figure  15.6.  Since,  for  /267 
6  i  0,  magnitude  L(@)  is  nothing  but  the  probability  of  missing  a  signal 
with  strength  6  ,  then  it  is  natural  that  L(  0 )  monotonically  decreases  when 
d  increases. 


It  follows  from  the  aforementioned  that  the  detector  provides  optimum  results 

only  when  0  s  0  and  ^  s  &  3,  [provides  minimum  n(0)  and  n(  6  ^)  values  for 
Pi,  t  and  P#p(0,)  ].  However,  it  is  evident  from  the  Figure  15.5  and  15.6  curves 


ii 


that,  when  e  >  e  p  detection  quality  monotonically  improves  with  a  rise  in 
6  (mean  observation  time  and  signal  miss  probability  decrease).  Therefore, 
if,  as  demonstrated  above,  6  ^  corresponds  to  the  minimum  signal  strength  value, 
an  excursion  from  the  optimum  detection  mode  occurring  when  0  >  6  ^  is  not 

dangerous. 

Since  sample  size  n(  &  )  is  a  random  value  changing  from  one  sequence  to 

another,  it  is  important  to  know  not  only  mean  sample  size  value  n(Q)  but  the 
magnitude  n($)  law  of  distribution  as  well.  However,  to  date,  there  has  been 
no  success  in  finding  sufficiently-general  relationships  for  computation  of  this 
law  of  distribution  and  most  of  the  time,  if  not  always,  there  is  success  in  computing 
only  variances  <rjj(0)  and  <*»<•, »  of  magnitudes  n(0)  and  n(  d  . 

A  brief  description  of  the  methodology  for  computing  characteristics 
n(0) t  L(  f  ),  and  dispersions  »*<#)  and  <*«(•,)  ,  when  sample  values  y.,  y2,  .  .  ., 
y^  are  statistically  independent,  follows. 

Function  L( 0)  is  determined  from  the  formula 


(15.16) 


where  h  —  supplemental  parameter  determined  from  functional  equation 


(15.17) 


Here,  W(y^,  6  )  —  unidimensional  probability  density  of  sample  value  yi  when 
sicpal  strength  equals  Q  .  Therefore,  W(y^,  0  )  and  W(y^,  6  —  unidimensional 

sample  y distributions  when  there  is  no  signal  (0:0)  and  when  a  signal  with 
strength  6 ^  is  present,  respectively. 

Mean  sample  size  n(0)  is  determined  from  the  formula 


L  (0)  lnP,  +  |I— t  Infl, 


V 


V  (y,.  «> 

*W  =  I  2(yiW(y„V)dyt, 


(15.19) 


(15.20a) 


1.  e., 


It  follows  from  (15.15)  and  (15.18)  that 


(15.20b) 


TT/KT  (1—  Par)  In  Pi  4- Par  In  P» 

"(0) - m 


(15.21) 


i  _  Pnp (0i)  l»Pff[l~Pnp(9i)l  iw  Pa 


(15.22) 


It  f ollows  from  (15.13),  (15.21),  and  (15.22)  that,  when 
Pnp(01) ^  0,1  ,  it  is  possible  to  assume 


and 


n  (0)  *  ■ 


Pgyfii) 

-rm 


(15.23) 


_ U  10  o~ 

n  (0,)  * - . 

1  1  -«15J) 


The  following  approximate  formulas  [16  and  85]  are  valid  for  magnitudes  n(0) 
and  n(  0^)  of  variances  o,*,0,  and  ,  given  slight  P„T  and  Pnp(0,)  : 


”T7W 


(15.24) 


(15.25) 


mr* 


where 


o|«e,  =lz  («))*- [*  (0)1*; 

OD 

17WlJ  =  5  2a  (1/1)  W'  (y,.  0)  rft/r. 

— «e 

oj(0)  and  o^e,,  —  variance  <**<«»  values  when  R  =  0,  and  0=®i  f 

respectively. 

It  is  possible  to  use  formulas  (15.16)— (15.26)  to  compute  the  main  charac¬ 
teristics  of  the  optimum  sequential  detector. 

It  follows  from  (15.16)  that,  when  h  s  0,  magnitude  L(  0  )  will  become  in¬ 
determinate.  Magnitude  n(0),  computed  from  formula  (15.18)  accordingly  /269 
also  turns  out  to  be  indeterminate.  The  following  relationships  are  obtained 
after  these  ambiguities  are  expanded: 


(15.26) 

(15.27) 


and 


(15.28) 


f7flpr  _  In  fa  In  P«  f  dh  1 

l  (15.29) 

where  Q*  is  value  6  at  which  h  s  0. 

The  next  section  will  present  an  example  illustrating  use  of  the  above-derived 
formulas  and  providing  a  representation  of  sequential  detector  quality. 

15.4  Sequential  Detection  of  a  Fluctuating  Pulse  Signal 

Let  the  signal  comprise  a  train  (packet)  of  independently-fluctuating  pulses 
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J3B88  (along  with  noise)  throuch  apt  optimum  linear  filter  matched  with  the 
»  1,  e. ,  having  bandwidth 


77»-L, 

H 


(15.30) 


where  —  deration  of  each  pulse. 


An  inertia-free  linear  detector,  which  separates  the  signal-plus-noise  (or 
noise  only)  envelope,  is  osnnaeted  at  filter  output.  The  distribution  over  time 
of  the  anticipated  signal  pulses  is  known  at  the  point  of  reception  and  the  task 
is  to  determine  whether  or  not  such  a  signal  is  present  at  receiver  input. 

Sequential  analysis  of  voltage  y(t)  St  detector  output  is  used  to  solve  this 
problem.  Samples  yJ#  y2,  •  •  «*  yA  bf  this  voltage  are  token  at  moments  tj,  t^, 

•  •  •*  t^,  corresponding  i a  the  aoments  the  anticipated  Signal  pulses  cease  at 


(as  a  signal  pulse  appear^  at  optimum  linear  filter  input,  signal 
amplitude  at  filter  output  reaches  its  maximum  at  the  mbtfent  the  input  pulse  ceases). 

Consequently,  sample  interval  At  equals 


where  T»  —  signal  pules  spacing. 

.  \  ■  ^  V 

Sines  Tm>  V  avid  filter  band  ft  satisfies  relationship  (15.30),  it  is 
possible  to  consider  that,  givsn  such  an  interval  At  selection,  sampling  values 
yi  mutually  are  statistically  Independent,  both  for  noise  alone  (when  there  is 
no  signal)  and  when  s  signal  is  present. 

When  there  is  no  signal  and  only  normal  noise  is  active  at  detector  /270 
input,  the  distribution  of  yi  voltage  values  at  linear  detector  output  is  stAordinate 
to  Rayleigh's  lost 


W  to'  =  ufci e  “ where  y,  >  0, 


W  (j/i*  0)  =  0 


where  Vy  <  0, 


(15.31) 


where  Um  —  extant  noise  voltage  value  at  detector  input  (i.  e.,  at  optimum  filter 
output) . 


When  the  aforementioned  signal  is  present,  the  distribution  takes  the  form 


where  ft  >0-,  | 

(  (15.32) 

yt  <  0,  ) 


W(y„  p*)  =0 


where 


where,  in  accordance  with  (9.77)  and  (9.78) 


1  ! 
'  Um* 


1+^5- 


(15.33) 


Qep/n  —  energy  of  one  packet  pulse;  Qcp  *”*  energy  of  the  entire  packet  (of  n 
pulses) . 


We  will  designate: 


Qep 

'ivi: 


(15.34) 


then,  9  =  0,.  when  there  is  no  signal,  while  when  there  is  a  signal  with  anticipated 
average  energy  Qcpt 


0-0, 


(15.35) 


Therefore,  it  is  possible  to  write  distribution  (15.32)  in  the  following  form: 


W(y„  0) 


jt 


tV  <l+») 


rw, 


»i» 


where  y y  ^  0, 


(15.36) 


W(yt,  0)  —  0 


where  yi  <  0. 


Substituting  expressions  (15.31)  and  (15.36)  into  formula  (15.19),  we  have 


i+0i  d+e,)2 um* 

It  follows  from  (15.20a)  and  (15.37)  that 

( )_  l+a,  +u+el)2t/mt* 


where 


'  *+,>  =  2U*(l  +°): 


therefore 


2  (Brin¬ 


'd +9). 


1+9,  1+0, 

Further,  from  (15.27),  (15.36),  and  (15.37),  we  will  find  that 

UtejF-fln  —ir\*+2(ln  -i— ) — -5* - ^s+F - ®i - 

V  i+o* /  \  i+oJti+eoa/m*1"  ^Lc+eoayn,*]  ’ 


where 


|  y,\W  (y„  9)  dy,  -  2  (t  +  0)*  4 (/**. 

Substituting  the  expressions  for  y?  and  yt 4  into  (15.39),  we  obi 


5W-l'nrfir),+2(,"Hir)“ 


+*pss?t- 


From  (15.38)  and  (15.41)  we  will  find 


-PW-U  (0)1*  - • 


Hence,  it  follows  that 


arc 


(15.43) 


From  (15.21),  (15.22),  and  (15.38)  we  have 


—isr  U— -Par)  Ingj+Pat 
n(U)  — - - 

- -In  (1+0,) 


n  (0,) 


I  +8, 

Pap  (®i)  In  gi  +  n  -  Pnp  C0»)1 1"  Pt 
■  6j  —  ln(I+0t) 


(15.44) 


From  formulas  (15.24)  and  (15.25)  we  will  find  [considering  relatidnships 
(15.38)  and  (15.43)]  that 


I 

li+e,  1 

|*  1"  Pi 

r  ®»  ln 

(«+e,)j3 

U+®t 

.  St^nP. 

let-in  (l+e,)J»* 


(15.45) 


It  is  not  difficult  using  formulas  (15.44),  (15.45),  and  (15.13)  to  /272 

compute  the  average  sizes  of  samples  n(0)  and  n(^)  and  their  variances  o*(0) 
and  0*  (B,)  for  given  values  6  ^  pap  (9i)  and  pn  ?  . 

For  example,  let  0 ^  =  1;  Pnp(0,)  =  0.1  and  P„  T  s  10“^.  Then,  from  the 

aforementioned  formulas,  we  will  obtain 

M0)~U,5;  JM0l7“2O-  J  (15.46) 

o<i<o>=*8,5;  o'ims,)  —  16.  ) 

Here 


^2>-0,74  dnc^-0,8. 

n(  0)  «(»>) 


394 


ii tit 


It  is  significantly  more  difficult  to  compute  characteristics  L( 0 )  and 

r»(0)  for  random  signal  strength  values  0  (and  not  only  for  0  0  and 

0  -  0,). 

To  do  so  requires  that  supplemental  parameter  h  s  h(  6  )  first  be  found  from 
functional  equation  (15.17). 

In  the  case  under  examination  where  distribution  W(y^,  6  )  is  described  by 
expression  (15.36) ,  equation  (15.17)  takes  the  form 


—Mil 

(1+9,)  VJ, 


■  J 


-»l* 

.(7+375(7 


0+0) 


in 


dyt  =  1. 


After  transformation  [85],  this  equation  takes  the  following  form: 

(I  +0J)*-AO1  (1 4-8,)*-*  (1  +0)  - 1.  (15.47) 


It  is  transcendental  relative  to  h.  Therefore,  one  must  find  the  dependence 
of  h  on  6  by  solving  equation  (15.47)  for  0  ,  rather  than  h: 


u+eo-d+oo1-* 

0,  h 


(15.48) 


e , 


It  is  not  difficult  using  this  formula  for  a  given 
of  B  on  h.  This  dependence  is  depicted  in  Figure  15.7  for 


to  plot  the  dependence 
s  1.  Here, 


Figure  15.7 


Figure  15.8 
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the  following  corresponds  to  vslue  h:Q 

1 

j 

|  0=0' =0,4.  (15.49) 

It  is  possible  to  use  the  Figure  15.7  curve  to  compute  dependence  L(@) 
determined  from  formula  (15.16).  Here,  at  the  ambiguity  point  (  &  s  6'  , 
j  h  s  0),  formula  (15.28)  rather  than  formula  (15.16)  should  be  used  for  the  com¬ 

putation. 

Characteristic  L(@)  depicted  in  Figure  15.8  and  corresponding  to 
|  ^np^i)  s  0.1,  P»  T  s  10‘3,  and  6^  =  1  is  computed  in  the  same  way. 

At  points  0  s  0  and  0  =  0,  =  1  this  characteristic,  as  would  be  expected, 
provides  these  initial  magnitudes: 

£(0)  =i_pj|i  »o,999, 

/273 

Dependence  n(@)  is  computed  from  formula  (15.18).  Here,  values  L($)  are 

taken  from  the  Figure  15.8  graph,  while  magnitude  z(Q)  is  computed  from  formula 
(15.38). 

At  the  ambiguity  point  (  6  =  6  ,  h  =  0),  magnitude  n( O' )  should  be  computed 
from  formula  (15.29). 

In  the  case  under  examination,  from  (15.38)  we  have 

r*33E>1  -  9t 

L  <»«  \m-  >+•»  *  (15.50) 


From  (15.48)  we  will  find 


(d9\  (l+et)lln(H-et)]«  . 

V  dh  h,m  o  20, 
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therefore 


/  dft  \  _ _ 20i _ 

\  dQ  ]e-e-  (l+e,)lln(H-e,)l*  ’ 

Substituting  (13.50)  and  (13.51)  into  (15.29),  we  obtain 


(15.51) 


n(0') 


'«»•'«  x 


(15.52) 


P*  t 


In  the  particular  case  being  examined,  when  0.  = 

_3  1 

=  10  ,  this  formula  provides 


1, 


Pop  (0i)  =  0.1,  and 


nfSo-Sl. 


Figure  15.9 

Figure  15.9  depicts  characteristic  n (0)  plotted  in  this  manner.  Where  /274 

6  =  0  and  0  =  0  it  has  the  values  n(0)  =  11.5  and  n(0^)  =  20  already 
found  above. 

15.5  Comparison  of  Sequential  Detection  to  Classic  Detection 

Now,  we  will  compare  results  obtained  during  sequential  detection  to  corres¬ 
ponding  results  from  classic  binary  detection. 


It  is  possible  to  characterize  the  gain  obtained  during  sequential  detection 
by  the  ratios: 


where 


rt-P(O)n(O)  +  P(Qft(0,). 


(15.53) 


Here,  n  —  sample  size  during  classic  binary  detection  with  identical  signal 
strength  values  0  ^  and  error  probabilities  P«  *  and  Pap  . 

The  Figure  12.9  curves  from  classic  binary  detection  are  valid  in  this  case 
of  detection  of  a  train  of  independently-fluctuating  signal  pulses.  In  this  figure, 
ratio  Qep/N,  ,  where  <?«,  —  total  energy  of  n  pulses,  is  plotted  on  the  Y-axis. 
Therefore,  in  accordance  with  (15.34),  we  have 


while,  for  anticipated  signal  strength  6 ^ 

^2-nO,.  (15.54) 

We  will  do  a  comparison  for  the  following  values  of  magnitudes  6  P*  t, 
and  P an  (®i) : 


01  —  1;  P,,- 10-*;  Pap(0»)-0,1» 


i  •  6#  $ 


Pn*-1-Pnp(°«)-0’9- 


Here,  formula  (15.54)  provides 


and,  from  the  Figure  12.9  curve  corresponding  to  Pn  t  -  10“*  and  Pn  » 
we  will  find  that 


0,9 


> 


a  —35. 


(15.55) 


Relationships  (15.46)  were  obtained  in  §15.4  for  sequential  analysis  /275 
of  the  same  Pn  »  and  Pnp  (0i)  values.  Therefore,  from  (15.46),  (15.53),  and  (15.55), 
we  have 


n 

it 


_ 35 _ 

ll.5P(0)  +  20(l-P(0)| 


=  1.75; 
t  ,75 

I  — 0.42P  (0)  ‘ 


It  follows  from  the  last  relationship  that  gain  n/n  in  an  average  (unconditional) 
sample  size  will  depend  on  a  priori  signal  absence  probability  P(0)  and  increases 
when  this  probability  rises.  The  following  result  when  probability  P(0)  equals 
0,  0.5,  and  1,  respectively: 


~  =  1,75;  -2- -2.2  and  ~  =  3,05. 
n  n  n 


Hence,  it  follows  that,  in  the  case  under  examination,  the  gain  in  an  average 
sample  size  (i.  e.,  in  mean  observation  duration  T)  is  1.75  4-  3.05  [depending 
on  a  priori  probability  P(0)].  Here,  the  greatest  gain  magnitude  (3.05)  corresponds 
to  value  P(0)  s  1  or  [where  P(0)  i  1]  to  cases  when  there  is  no  signal. 


As  computations  demonstrate  [85],  given  a  decrease  in  permissible  false-alarm 

probability  and  all  other  things  being  equal,  gain  n/n(0)  monotonically  increases 

and  may  significantly  exceed  unity.  For  example,  where  6 ,  =  1,  ^np  (8,)  s  0.9, 

*  * 


and  Put  -  10  .  the  result  is 


n 

*W) 


-5,2. 


The  aforementioned  examples  corresponded  to  a  case  where  Pnr  'ZPop  (Bt)  .  If, 
in  accordance  with  problem  conditions, 


—  Pa,  (®i). 


\ 


Y 


k- 


then,  as  computations  for  several  signal  types  made  by  different  authors  shoe, 
the  result  is 


n  _  H  r 

n(0j  n  (6,) 


Since  the  result  here  is 

n(0)»n  (0,j, 


then,  from  (15.53),  we  obtain 

n  —  n(0)  [P  (0)  -f  P  (C)  1  -MO) 


and,  consequently. 


—  as  2. 


n 


It  follows  from  these  data  that,  when  Pn  r  *  Pn9  (0,),  sequential  analysis  /276 
provides  a  gain  in  mean  observation  duration  by  approximately  a  factor  of  2. 

« 

If  Pn  ,  Pn p(®i)»  then  the  gain  may  be  significantly  greater,  especially  if  signal 

absence  probability  P(0)  is  sufficiently  high  or  if  the  gsin  is  assessed  only 

by  ratio  n/n(0),  i.  e. ,  only  for  a  case  of  no  signal.  Therefore,  it  is  most  advisible 
to  use  sequential  analysis  when  problem  conditions  stipulste  that  Pn  ,  €  Pn,  (6.) 
and  when  the  a  priori  signal  absence  probability  is  significantly  greater  than 
the  a  priori  signal  present  probability,  i.  e.,  P(0)  >  P(C).  This  situation  arises 
often  in  radar. 


However,  along  with  the  aforementioned  advantage,  sequential  analysis  has 
major  drawbacks  compared  to  classic  analysis. 


The  first  drawback  is  the  randomness  of  sample  size  n,  i.  e. ,  of  observation 
duration  T. 

In  the  example  examined  in  §  15.4  [see  relationships  (15.46) ],  sample  size 
standard  deviation  <*n  from  its  mean  value  is  74 — 80%.  Computations  made  for 
several  signal  types  by  different  authors  demonstrate  that  standard  deviation 
°n  from  the  average  sample  size  is  50—80%  for  other  typical  signals  as  well. 
This  signifies  that,  in  a  rather-signif icant  percent  of  sequences,  observation 
time  may  exceed  its  mean  value  somewhat. 

So-called  »lit  sequential  analysis  is  used  to  ameliorate  this  shortcoming. 
Here,  a  predetermined  maximum  observation  time  value  T-m  is  established  and 
analysis  mandatorily  must  cease  when  this  time  runs  out  (if  it  has  not  already 
ended). 

As  long  as  observation  duration  does  not  exceed  7'lnm>  i.  e. ,  the  number 
of  steps  i  does  not  exceed  magnitude 


sequential  observation  will  be  conducted  with  respect  to  the  steps  in  the  normal 
manner,  i.  e. ,  with  two  thresholds  J3 ^  and  ft 2’  In  3^;eP  ^ana  »  only  one  threshold 

instead  of  two  thresholds  J3 ^  and  /Ij  is  established  (as  in  classic  binary 
detection).  Therefore,  if  observation  has  not  yet  concluded  prior  to  transition 
to  this  step,  then  it  will  conclude  mandatorily  at  this  step  by  making  the  decision 
"no"  (if  it  turns  out  that  )  or  the  decision  "yes"  (if  Jiumm > P«>. 

Splitting  the  observation  into  steps  *imm  rules  out  the  possibility  of  very-long 
observation  durations  T  significantly  exceeding  T.  However,  the  lower  the  rmm 
selected,  i.  e.,  the  more  significant  the  splitting,  the  less  the  gain  T/T  in 
mean  observation  time  obtained  during  sequential  analysis.  This  is  explained 
by  the  fact  that,  when  ratio  TMM/T  decreases,  sequential  analysis  becomes  /277 
more  like  classic  analysis. 


The  second  shortcoming  of  sequential  analysis  compared  to  classic  analysis 


is  its  great  complexity.  Sequential  analysis  may  be  simplified  considerably  by 
preliminary  conversion  of  analog  sampling  values  y.  into  quantum  binary  magnitudes 
— ones  and  zeros. 

This  simplified  method  is  examined  in  the  following  section. 

15.6  Sequential  Analysis  of  Quantum  Samples 

In  the  sequential  analysis  approach  under  examination,  sample  values  y^, 
y2,  .  .  . ,  y^  beforehand  are  converted  into  ones  and  zeros  for  some  quantization 


Figure  15.10 


threshold  yQ  (Figure  15.10):  if  y^  >  yQ,  then  a  one  is  produced  at  quantizer 
output;  otherwise,  a  zero  is  produced  (where  <  yQ) . 

Thus,  ones  and  zeros,  which  are  subjected  to  sequential  analysis,  are  formed 
at  quantizer  output.  Interval  At  is  such  that  all  observation  results  (i.  e. , 
all  ones  and  zeros)  among  themselves  are  statistically  independent. 

It  usually  is  assumed  for  the  purposes  of  further  simplification  that  usable 
information  (i.  e.,  information  as  to  whether  a  signal  is  present  or  not)  will 
be  contained  exclusively  in  the  quantity  of  ones  (or  zeros)  in  a  given  observation 


I 

interval  (iAt),  i.  e. ,  disregarding  the  influence  of  the  position  of  the  ones 
I  over  time  within  this  interval. 

The  probability  that  there  will  be  k^  ones  and,  consequently,  (i«_k^)  zeros 
in  interval  iAt  is  determined  by  known  binomial  distribution 


W0(klt 


lJ*i  *,l  (/-*,)! 

(A|=0.  1,  2. 


pS*(i 

...  ,  l), 


(15.56) 


where 


P9~P*{y,>y  •) 


is  the  probability  that  y^  will  exceed  yQ,  i.  e.,  the  probability  of  the  appearance 
of  ones  at  quantizer  output. 

Index  6  in  magnitudes  1 F«(*i.y*)  and  Pg  underscores  the  circumstance  /278 
that  these  magnitudes  will  depend  on  signal  strength  6  •  Therefore,  instead 
of  (15.56),  it  is  possible  to  write 


'’S' (l 


(15.57a) 

(15.57b) 


where  P«,  and  PQ  —  probabilities  of  the  appearance  of  ones  given  presence  of 
a  signal  with  anticipated  strength  and  given  absence  of  a  signal  (  6  =  0), 
respectively,  while  ^e,(Ai.  y«)  and  W9(kt,y9)  —  binomial  distributions  for  presence 
of  a  signal  of  intensity  0 ^  and  when  there  is  no  signal  (  0=0),  respectively. 


It  follows  from  (15.57)  that  the  logarithm  of  the  likelihood  factor  in  this 
case  equals  [also  see  (15.19)] 


*iu  y*)  -  in 


(*«.?>) 


S*  (i— 


i 


In  accordance  with  the  rules  of  sequential  analysis,  magnitude  z(i,  yQ)  in 
each  step  i  must  be  compared  with  thresholds  and  0*.  determined  from  formulas 
(15.13) j 

if  2  (i,  y0)  <  Pi  ,  then  hypothesis  Hq  (no  signal)  is  accepted; 

(15.59) 

if  2  (<’.  yo)>Pt,  then  hypothesis  (signal  with  strength  0^) 
is  accepted; 

finally,  if  it  turns  out  that 

(15.60) 

then  the  test  continues,  i.  e.,  there  is  a  transition  to  the  next  observation 
step. 


It  is  simple  to  obtain  the  following  decision  areas  based  on  relationships 
(15.50)— (15.60)  j 

if  k,  <^7^t  ,  then  hypothesis  H  is  accepted; 

if  kt  >  ,  then  hypothesis  H.  is  accepted;  (15.61) 

I  ^ 

if  2ii  .  then  the  test  (observation)  continues. 


I 
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i 

I 

! 

I 

i 

1 

i 

I 
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It  is  possible  to  draw  the  following  conclusions  from  examination  of  rela¬ 
tionships  (15.61)  and  (15.62): 

1.  A  sequential  detector  completely  is  determined  by  three  parameters  a, 

b,  and  c,  which  are  functions  of  four  parameters:  Pi,  P»,  PQ,  and  . 

2.  Parameters  a,  b,  and  c  will  depend,  among  other  things,  on  quantization 
threshold  yQ  (since  probabilities  and  PQ  will  depend  on  this  threshold). 
Therefore,  an  appropriate  (optimum)  quantization  threshold  must  be  selected  to 
insure  better  sequential  detector  operation. 

3.  Since  parameters  a,  b,  and  c  remain  unchanged  during  the  sequential  analysis 
process,  then  this  analysis  will  boil  down  to  computation  of  the  number  of  ones 

at  quantizer  output  and  comparison  of  this  number  with  thresholds  in  accordance 
with  very-simple  relationships  (15.61). 

It  follows  from  what  has  been  stated  that  the  sequential  detector  being  examined 
is  comparatively  simple  in  design.  Its  basic  elements  are  a  quantizer,  which 
converts  observed  signal-plus-noise  y(t)  into  a  sequence  of  ones  and  zeros,  and 
a  binary  scaler. 

Introduction  of  quantization  will  lead  to  some  loss  in  usable  information 
(see  Figure  15.10).  Therefore,  sequential  analysis  of  quantized  samples  in  principle 
must  provide  worse  results  than  optimum  sequential  analysis  of  non -quantized  sample 
values  described  in  preceding  sections.  However,  as  theoretical  and  experimental 
analyses  done  by  several  authors  [16,  85,  and  121]  have  shown,  the  resultant  loss 
in  typical  cases  is  relatively  slight.  Thus,  for  example,  during  detection  of 
a  sinusoidal  carrier  (and  optimum  selection  of  quantization  threshold  yQ),  intro¬ 
duction  of  quantization  will  lead  to  a  mean  observation  time  increase  by  a  factor 
of  approximately  1.5  [16]. 

Therefore,  in  a  number  of  cases,  significant  detector  design  simplification, 
which  accompanies  introduction  of  quantization,  completely  may  compensate  for 
the  resultant  slight  deterioration  in  detection  quality. 
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CHAPTER  SIXTEEN  /Z80 

ANALOG  MESSAGE  RECEPTION  ANALYSIS  USING  THE  DISTRIBUTION  PARAMETER  ESTIMATE  APPROACH 
16.1  Problem  Formulation  in  the  Classic  Theory  of  Statistical  Estimates 

Initially,  we  will  examine  the  basic  assumptions  of  the  so-called  classic 
theory  of  statistical  estimates,  i.  e.,  a  theory  formulated  several  decades  ago, 
and  then  we  will  explain  how  this  theory  applies  to  solving  the  problem  of  the 
reception  of  analog  messages  on  a  noise  background.  Use  of  the  latest  theory 
of  statistical  estimates,  the  foundation  for  which  was  laid  in  the  works  of  A. 

Wald  [30],  is  examined  in  Chapters  17  and  18. 


In  the  classic  theory  of  estimates,  the  problem  is  formulated  in  the  following 


manner: 


A  sample  of  n  random  magnitudes  y^f  y2»  •  •  . »  yn  exists  (they  need  not  be 
statistically  independent)  with  a  precisely-known  law  of  distribution,  except 
for  some  parameters  «i*  at,  ....  am  .  For  simplicity,  initially  we  will  formulate 
the  problem  relative  to  the  case  when  only  one  distribution  GC  parameter  is  unknown. 
This  signifies  that,  for  a  given  Ct ,  the  following  n-dimensional  probability 
density  is  known 


(16.1) 
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The  task  is  to  establish  the  magnitude  of  parameter  CX  based  on  the  given  sample 
(yl»  *  *  •»  yn). 

The  assumption  is  that,  in  the  process  of  the  given  sample,  parameter  CX 
remains  unchanged.  During  transition  from  one  sequence  to  the  next  (i.  e. ,  from 
one  sample  to  another),  desired  parameter  CX  may  change  like  a  random  magnitude 
or  remain  unchanged. 

Since  sample  values  y^,  .  .  . ,  yn  are  random  magnitudes,  and  sample  size 
n  is  finite,  then  it  is  impossible  to  establish  the  parameter  CX  magnitude  with 
sufficient  accuracy  based  on  sample  analysis.  It  is  possible  only  to  draw  an 
approximate  conclusion  as  to  its  magnitude  or,  as  they  say,  to  make  an  estimate. 

Here,  the  following  approaches,  which  differ  in  principle,  to  estimating  the  needed 
parameter  are  possible: 

1.  Point  estimate. 

2.  Interval  estimate. 

In  a  point  estimate,  some  value  CX  ,  being  a  function  of  sample  values  (y^, 

.  .  .,  y  )  and  accepted  as  the  true  parameter  value,  is  supplied  as  a  result  of 
n  ^ 

sample  analysis.  This  value  CX  is  called  the  parameter  CX  estimate. 

Function  type 

. Vn)  (16.2) 

is  selected  so  that  estimate  ££*  is  as  close  as  possible  (in  some  predetermined  /281 
sense)  to  true  value  OC  ,  i.  e. ,  so  the  error 

8  *=<*•—«.  (16.3) 

arising  during  the  estimate  is  as  small  as  possible. 

As  a  result  of  the  randomness  of  OC  *  and  OC  ,  error  (5  also  is  a  random 
magnitude,  which  changes  from  one  sequence  to  another.  Therefore,  the  complete 


composite  law  of  distribution  of  magnitudes  CX*  and  CC  characterizes  the  quality 
of  the  estimate 

P{ «*.  a)=*P(a)P„(a*).  (16.4) 

Howevert  in  msny  cases,  a  priori  distribution  P  (*)  of  needed  parameter  CX 
is  unknown,  while,  in  several  cases,  parameter  CX  is  in  no  way  random  due  to  its 
physical  nature.  In  these  cases,  distribution  P*  (a*)  of  estimate  CX*  for  a 
given  parameter  CX  value  must  suffice  instead  of  inconditional  distribution 
P  (t x*,  a)  (this  law  must  be  found  for  all  possible  values  of  CC  ).  But,  in  several 

cases,  it  is  very  difficult  or  even  practically  impossible  to  find  law  of  distribution 
Pt  (**)  . 

Consequently,  in  such  cases,  the  quality  (validity)  of  the  estimate  CX*  found 
turns  out  to  be  fully  or  partially  known.  This  circumstance  is  the  main  drawback 
inherent  in  a  point  estimate.  However,  as  will  be  explained  subsequently,  given 
a  good-enough  estimate  approach  [i.  e. ,  given  sufficiently  successful  selection 
of  the  type  of  function  (16.2)]  and  large  sample  size  (n  >1),  law  of  distri^wt 
Pa  (a*)  turns  out  to  be  normal  or  close  to  normal  and,  consequently,  error  6 
sufficiently  is  characterized  by  its  mean  value 

«)— /W«®*— ®  (16.5) 

and  mean  square 

3J=*AM®* — «)*•  (16.6) 

Here,  index  CC  denotes  that  averaging  is  conditional,  i.  e. ,  done  given  that 
magnitude  CC  is  unchanged. 

Magnitude  also  is  called  estimate  bias.  Evidently,  the  less  bias  ^ 
and  mean  square  ,  the  higher  the  quality  of  the  estimate. 

In  many  rases,  it  is  considerably  simpler  to  compute  parameters  £  and 

than  it  is  to  find  law  of  distribution  (a*)  .  Therefore,  large  samples 


(n  >  1)  and  a  good  estimate  method  significantly  ameliorate  the  aforementioned 
point  estimate  shortcoming. 


But,  given  a  correct  estimate  method,  an  increase  sample  n  size  provides 
an  increase  in  estimate  accuracy.  Therefore,  it  also  is  possible  to  assert  that 
the  more  accurate  the  estimate  with  respect  to  problem  conditions,  the  more  basis 
exists  for  use  of  a  point  estimate. 

If,  on  the  other  hand,  according  to  problem  conditions,  the  sample  size  /282 
is  small  and  the  estimate  may  not  be  precise  enough,  use  of  a  point  estimate  usually 
turns  out  to  be  inadvisible  since,  here,  due  to  the  aforementioned  shortcoming, 
one  cannot  obtain  sufficiently-valid  data  concerning  the  quality  of  estimate 
CC  found.  In  such  cases  (given  small  samples),  the  interval  estimate  method 
usually  is  used  in  statistics.  This  approach  is  based  on  introduction  of  so-called 
confidence  intervals. 

In  this  approach,  the  result  of  the  analysis  of  sample  (y^,  .  .  .,  yp)  is 
not  the  specific  magnitude  G(  of  the  needed  parameter  (as  is  the  case  in  a  point 
estimate) ,  but  only  interval  which  will  comprise  needed  magnitude 

with  a  probability  equalling  some  preselected  magnitude  (1—0,  called  the  confi¬ 
dence  coefficient.  The  latter  is  close  to  unity  (0.99,  for  example),  i.  e., 

€<1,  in  order  to  obtain  sufficiently-high  confidence  in  the  estimate. 

Interval  a,  -f*  a,  is  called  the  conf  idence  interval  corresponding  to  a  given 
confidence  coefficient. 

This  estimate  mathematically  boils  down  to  the  following.  Two  sample  value 

functions  a,  (ft. ....  yn)  and  . . .  are  selected  so  that  the  following  relationship 

is  satisfied  for  any  value  of 


P la,  (y, . yH)<a<a , (y, . y„)l -  l  —  «•  (16.7) 

Here,  as  usvbI,  P(  ]  denotes  the  probability  that  inequalities  enclosed  in  the 
square  brackets  are  satisfied. 

Since  sample  values  y^,  .  .  .,  yn  are  random  magnitudes,  then  limits 
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and  CC  2  of  the  conf idence  interval  also  are  random  magnitudes  changing  from  one 
test  (one  sample)  to  another. 

Needed  parameter  CC  by  nature  does  not  have  to  be  random  and,  in  relationship 
(16.7),  parameter  CC  is  considered  simply  some  fixed  magnitude  irrespective  of 
whether  it  is  random  or  not.  Therefore,  one  should  treat  the  left  portion  of 
relationship  (16.7)  as  the  "probability  that  interval  a,  t-o,»  includes  needed 
parameter  CC  "  rather  than  the  "probability  that  CC  will  be  found  within  the  limits 
of  interval  a‘  ‘r  0l  " . 

The  form  of  functions  a,  [yu  ....  yn)  and  a,  (i/, . yn)  should  be  such  that,  for 

a  given  confidence  coefficient  0  ~  e) ,  interval  (ai  *5*  cti)  will  be  as  small  as 
possible.  This  interval  is  a  random  magnitude  changing  from  sample  to  sample. 
Therefore,  we  have  no  assurances  that,  in  all  tests,  this  interval  will  be  small. 

But,  a  drawback  of  this  type  is  inherent  in  a  point  estimate  too,  since  point 
estimate  error  6  also  in  individual  cases  may  not  be  slight. 

The  advantage  of  the  interval  estimate  method  is  that,  as  a  result  of  the 
estimate,  i.  e.,  after  finding  interval  ,  we  obtain  not  only  the  estimate 

itself  (i.  e.,  an  indication  that  needed  parameter  CC  will  be  found  within  the 
limits  d|i«,  found),  but  also  we  know  the  quality  of  the  resultant  estimate 
beforehand — we  know  that  the  probability  that  needed  parameter  CC  is  encompassed 
by  resultant  interval  equals  previously-established  (and,  consequently,  /283 

acceptable)  magnitude  0  —  *) .  Therefore,  the  confidence  interval  approach  is 
applicable  for  small  samples  as  well. 

It  follows  from  the  aforementioned  point  estimate  and  interval  estimate  pro¬ 
perties  that  the  point  estimate  is  more  convenient,  given  large  samples  (n  >  1), 
which  allow  slight  point  estimate  error  S*  .  On  the  other  hand,  an  interval  estimate 
usually  turns  out  to  be  more  convenient  for  small  samples. 

Both  estimate  approaches  also  are  used  to  solve  problems  of  reception  of 
signals  on  a  noise  background.  However,  the  point  estimate  is  used  significantly 
more  often  than  is  the  interval  estimate.  This  is  because,  first,  signal  (message) 
reception  usually  requires  very  high  message  reproduction  accuracy  and  resultant 
sample  size  n  is  so  large  that  the  main  point  estimate  shortcoming  mentioned  above 


barely  manifests  itself.  Second,  the  result  of  the  estimate  when  reproducing 
signals  (or  messages)  usually  must  be  provided  automatically  and  continuously 
at  receiver  output.  Here,  naturally,  it  is  less  convenient  (and  sometimes  simply 
unacceptable)  to  form  in  a  receiver  two  functions  at  (ylt  ....  yn)  and  «*  (j/t,  •••.  yn) 
and  to  obtain  interval  aj-i-a,  than  it  is  to  form  and  provide  magnitude 
«*  iHu  •••.  yn)  at  output.  Therefore,  in  future,  we  will  limit  ourselves  to  examination 
only  of  a  point  estimate.  Here,  for  simplicity,  we  will  discard  the  term  "point" 
and  will  use,  as  is  the  convention  in  foreign  literature,  the  single  term  "estimate" 
for  the  following  concepts:  "estimate  process,"  "estimate  method"  [function 
a*(yu  •••.  yH)  type],  and  "estimate  result." 

For  example,  if  one  says  that  CC*  is  an  efficient  estimate,  then  this  denotes 

that  the  estimate  process,  estimate  method  [function  a*  Q/i . yn)  type],  and 

estimate  result  are  efficient. 


16.2  Basic  Relationships  During  a  Point  Estimate 

If  certain  sufficiently-general  differentiability  conditions  [23]  are  met, 
then  an  estimate  is  called  sufficient  and  the  mean  square  8^  of  the  error  arising 
during  the  estimate  satisfies  inequality 


M 


(16.8) 


where  L  —  function  determined  from  relationship  (16.1)  and  called  the  likelihood 
function. 

If  estimate  C£*  is  such  that  there  is  an  equal  sign  in  relationship  (16.8), 
then  such  an  estimate  is  called  efficient. 

Consequently,  for  an  efficient  estimate,  mean  square  error  8J?  has  the  minimum 
possible  value  and  this  value  is  determined  from  the  right  side  of  inequality  /284 
(16.8).  However,  an  efficient  estimate  exists  when  and  only  when  likelihood  function 
Hyt,  ....  yn  i  '/(satisf ies  two  special  conditions  [23].  We  must  shift  from  random 
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variables  y^,  .  .  . ,  yp  to  random  variables  5 . .  <x*  linked  on  a  one-to-one 

basis  to  them  in  order  to  formulate  these  conditions. 

Let  tf‘i>  "  i  f n  be  the  composite  probability  density  of  variables 

£ .  &»-»  for  givens  OC  and  OC  f  while  A(o*jjt)  —  probability  density  of 

estimate  CC  for  a  given  CC  value.  Then,  the  aforementioned  two  conditions  have 
the  following  form: 


A.  function  h  (|„  ....  a)  will  not  depend  on  CC  . 


(16.9 


This  inequality  is  satisfied 


3_lnA(g|«) 

dfX 


(16.1 


where  k  does  not  depend  on  C(*  (but  may  depend  on  CC  ), 


If  only  condition  A  of  the  above  two  conditions  is  satisfied,  then  estimate 
CC  is  called  sufficient. 


It  follows  from  the  nature  of  conditions  A  and  B  that  efficient  and  even 
sufficient  estimates  may  exist  in  far  from  all  the  cases  of  practical  interest 
to  us.  Therefore,  in  the  general  case,  inequality  (16.8)  makes  it  possible  to 
determine  only  the  lower  mean  square  error  limit. 

Along  with  mean  square  error  8?,  bias  magnitude  5^  (see  (16.5)]  also  is 
an  important  characteristic  of  estimate  CC  quality. 

Estimate  UL  having  zero  bias  is  called  unbiased.  Consequently,  these 
relationships  are  valid  for  an  unbiased  estimate 


8^  =  0,  i.  e. ,  AT»«*' 


(16.11) 


It  follow8  from  relation8hips  (16.8)  and  (16.11)  that  inequality  (16.8)  is 
simplified  for  unbiased  estimates  and  takes  the  form 


When  sample  size  n  increases,  estimate  quality  improves;  therefore,  when 
«-*■  oo  ,  estimate  (X  may  in  some  cases  acquire  those  valuable  properties  (ef¬ 
ficiency  and/or  nonbias),  which  it  lacked  when  n  was  small. 

An  estimate,  which  will  become  unbiased  when  n  -►  oo  f  is  called  consistent 
while  an  estimate,  which  becomes  efficient  when  n  -*»  oo  ,  is  called  asymptotical! 
efficient. 


Finally,  in  many  cases,  the  estimate  Ct*  law  of  distribution  will  become 
normal  when  n  -*■  oo  .  Such  Q(  estimates  are  called  asymptotically  normal. 

The  aforementioned  determinations  may  be  formulated  in  the  following  way: 

Estimate  CC  is  consistent  if  inequality  (16.11)  is  satisfied  when  /2B5 
n  -*■  oo  > 

Estimate  0(  is  asymptotically  efficient  if  mean-square  error  3a1  attains 
the  minimum-possible  value  determined  by  the  ricfit  side  of  inequality  (16.8)  when 

n  -*•  oo  . 


If  sample  values  y^,  .  .  .,  yR  statistically  are  independent,  then 


L  (JA. ... .  !fn  I «)  •/  (Sft  I <*)  f  (!/t  I  *> ...  /  0/n  I  *). 


(16.13 


where  f(y^  CO  —  unidimensional  probability  density  of  magnitude  y.  for  a  given 

a. 


Here,  inequality  (16.8)  takes  the  form 


Magnitude  AJ.  jlii-illLSil  j  also  ny  be  represented  in  the  form  of  the  following 
expressions: 

m.  [iiiUBja]1 .  j ,  ^  pmi*!  ]*  . 

-i  n&m* 

All  the  aforementioned  reletionehipe  applied  to  a  case  when  the  distribution 
of  samples  (y^,  .  .  . ,  yn>  will  comprise  only  one  unknown  parameter  CC . 

Now,  we  will  examine  s  esse  when  this  distribution  has  two  unknowns,  &  ^ 
and  CC  2>  In  this  case,  the  likelihood  function  has  the  form 

.  yja„  a,).  (16.1 

We  will  assume  that  both  unknowns  will  be  subject  to  estimation  and  we  will 
designate  the  results  of  the  estimate  snd  CC  * , 

Estimates  CC  *  and  ^2*  are  “"P1®  value  functions,  i.  e., 

Vn)  and  «* »*“«*•(», . yn).  (16.1 

For  notstionsl  simplicity,  we  will  limit  ourselves  to  a  case  of  unbiased 
estimates.  Here, 


cu*  *  a,; 


(16.1 


and,  consequently. 
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8a,  =»  M«,  (oi  —  a  I  )*  =  /M«,  (a!  —  /Ma,  a  I  )l  =  °1, 
8l,  =  Ma,  (o»  — O*)*  =  Ma,  (<**  —  Ma.O*)*  =  o«,  * 


(16.20) 


i.  e.,  in  a  case  of  unbiased  estimates,  msan  square  errors  and  8l”  coincide 
with  variances  °a,  and  0a, of  these  errors  (here,  both  mean  squares  and  variances 
are  conditional,  i.  e.,  found  through  averaging  for  given  CC ^  and  CC  ^  values). 

During  estimation  of  two  parameters  and  CC^,  besides  error  <r£,  and 
Oa,  variances,  often  it  is  important  to  know  mutual  correlation  factor  Pa,,  a,  of 
these  errors,  which  for  unbiased  estimates  by  definition  equals 

i*— «i) 

Pa, .a,  ~  0  g  (16.21) 

«•  •• 


Based  on  parameters  ffa,,  oa,  and  Pa,,  a,  ,  it  is  possible  to  plot  a  dispersion 
ellipse  determined  in  the  case  of  unbiased  estimates  (in  x,  y  coordinates)  from 
this  equation 


(*-««)*  2P, 


itiSa  (jc — c*i)(y — <*»)-+- 


(*-«•)* 


■  4  (l  — Pa, .a,)* 


(16.22) 


a* 

(16.22)  takes  the  form 


and  CC  * 


are  uncorrelated  (  Pa,,  a,  =  0),  then  equation 


(*.,)•  (K.)*  (16.22a) 


i.  e.,  ellipese  semiaxes  equal  2o«,  and  2oa,  . 

The  dispersion  ellipse  characterizes  the  spread  of  estimates  CC  *  and 
^  2*  relative  to  the  true  values  of  parameters  CC  ^  and  CC 

If  the  dispersion  ellipse  corresponding  to  a  given  method  of  estimating  para¬ 
meters  CC  ^  arKj  a2  is  located  entirely  within  the  dispersion  ellipse  corresponding 
to  some  other  method  of  estimating  the  same  parameters,  then  they  say  that  a  lesser 
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spread  corresponds  to  the  given  estimate  method  or  that  the  distribution  of  the 
estimates  of  CX  ^  and  Ct  ?  found  this  way  has  a  lesser  spread. 

If  the  joint  distribution  of  estimates  OC  ^  and  OC  *  has  a  lesser  spread 
than  the  distribution  of  any  other  pair  of  estimates  of  the  same  parameters 
(  Ct  i  and  Ct  ,  then  estimates  Ct  ^  and  Ct  ^  are  called  .jointly  efficient. 

Consequently,  that  method  of  estimating  parameters  and  that  insures 
jointly-efficient  estimates  result  thereby  guarantees  that  minimum-possible  spread 
results. 

The  following  relationships  are  valid  for  independent  sample  values  /287 
y^,  .  .  . ,  yn  for  jointly-efficient  estimates: 


fa.  .a. 


M  (dinfdlnf\ 
_ l  dott  da,  I 

VRWHImWT] 


dln/\*  ’ 


(16.23 


0-Pi, .J 


where  /  —  /  «*)—  uni  dimensional  distribution  of  sample  values  y.  for  given 

OC  ^  and  CC 

Analogous  relationships  may  be  obtained  also  for  the  case  when  a  statistical 
link  exists  among  the  sample  values  [23,  pgs  539-340]. 

It  followa  from  formulas  (16.23)  that,  when  =  0,  i.  e. ,  when 


* . 


(16.24 


and  given  jointly-efficient  estimates,  the  quality  of  the  estimate  of  each  of 
these  parameters,  for  example,  during  simultaneous  estimation  of  two  parameters 


is  the  same  as  if  the  second  parameter  OC  2  was  known  and,  consequently,  would 
not  be  subject  to  estimation. 


In  other  words,  when  the  aforementioned  conditions  are  met,  whether  or  not 
the  second  parameter  OC  2  is  known  or  unknown  and  subject  to  simultaneous  esti¬ 
mation  along  with  parameter  OC  has  no  impact  on  the  estimation  of  given  parameter 

ar 

However,  for  existence  of  jointly-efficient  estimates,  two  conditions  analogous 
to  the  aforementioned  conditions  A  and  B  (but  more  complex)  for  existence  of  efficient 
estimates  [23]  must  be  met.  Therefore,  it  may  tum  out  to  be  impossible  in  principle 
to  obtain  jointly-efficient  estimates  in  a  number  of  cases  of  practical  interest 
to  us.  In  addition,  condition  (16.24)  may  not  be  met,  even  for  existence  of  such 
estimates.  Here,  as  follows  from  formulas  (16.23),  variance  c£,  (or  <t«,  )  in  the 
case  of  a  simultaneous  estimate  of  parameters  GC^  and  GC  2  *s  greater  than  in 
a  case  when  parameter  QC^  is  the  only  unknown. 

In  many  cases,  when  distribution  L  (t/„  <**)  has  two  unknowns,  the 

task  is  to  estimate  the  magiitude  only  of  one,  CX  ^  for  instance,  while  GC  2  carries 
no  usable  information,  i.  e.,  is  parasitic. 

Therefore,  it  is  very  important  to  respond  to  the  next  question:  is  it 
impossible,  given  unknown  parameters  OC  ^  and  GC  2,  to  obtain  lesser  variance  "2, 
in  the  estimate  of  parameter  0^  if  there  is  no  requirement  here  also  to  /288 
estimate  parameter  OC  2,  i.  e. ,  if  one  assures  that  variance  °a.  may  be  as  great 
as  desired  here? 

In  the  general  case,  it  is  possible  to  obtain  such  a  gain  throucfi  refusal 
to  estimate  the  second  parameter.  However,  it  was  demonstrated  in  [23]  that, 
in  this  case,  when  (given  simultaneous  estimation  of  two  parameters)  resultant 
estimates  (  OC  *  and  OC  2*)  are  jointly  efficient,  refusal  to  estimate  parameter 
OC  2  does  not  make  it  possible  to  decrease'*’  variance  in  the  parameter  OC  ^ 
estimate. 


*See  the  page  423  footnote. 


This  denotes,  roughly  speaking,  that  if  resultant  estimate  quality  is  the 
best  of  the  possible  estimates  during  simultaneous  estimation  of  parameters 
a  and  OC^,  ^en  *s  i^P033^!®  to  improve  upon  the  best  quality  of  the  parameter 
(X  i  estimate  by  refusing  to  estimate  parameter  CX  ^ 

The  concepts  of  asymptotic  (where  n-*oo)  )  efficiency  and  an  asymptotically- 
efficient  estimate  exist  when  estimating  two  parameters  analogously  to  the  case 
of  estimating  one  parameter.  In  addition,  the  results  achieved  above  remain 
(qualitatively)  valid  also  in  those  cases  when  a  statistical  link  exists  among 
sample  values  (y^,  ...»  yp). 

The  examination  above  for  two  unknown  parameters  may  be  generalized  for  a 
case  of  a  random  (finite)  number  of  unknown  parameters  («i»  ** . «m). 

16.3  Maximum  Likelihood  Method 

The  material  presented  in  §  16.2  applies  to  existence  of  efficient  estimates 
and  makes  it  possible  to  compute  the  magnitude  of  the  efficient  estimate  variance. 
However,  it  does  not  respond  to  the  question  of  how  it  is  possible  to  find  efficient 
or,  in  any  event,  sufficiently-good  estimates,  i.  e. ,  how  one  goes  about  selecting 
the  form  of  function  Cifa*  . 

There  are  several  methods  in  mathematical  statistics  for  giving  estimates. 

The  most -widespread,  especially  as  applied  to  problems  of  receiving  signals  on 
a  noise  background,  is  the  so-called  maximum  likelihood  method.  It  is  the  most 
widespread  because  other  methods  either  provide  in  the  general  case  less  efficient 
estimates  (for  example,  the  moment  method)  or  is  more  complicated  for  realization 
[for  example,  the  method  based  on  determination  of  the  X-coordinate  of  the  "center 
of  gravity"  of  distribution  P,  (*)  ]. 

The  maximum  likelihood  method  for  a  case  of  one  unknown  parameter  GC  involves 
the  following. 

That  value  of  parameter  CC  at  which  likelihood  function  ^  —» has 

;  consequently,  needed  estimate  Ct  is  solution  of 

the  equation 


9 


Si 
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called  the  likelihood  equation.  This  equation  in  the  general  case  may  have  several 
roots,  including  roots  not  depending  on  (y^,  ...»  yp).  Evidently,  roots  /289 
not  depending  on  (y.»  .  .  .,  y  )  must,  a  fortiori,  be  discarded,  since  it  is  mandatory 

A  n*  n 

that  needed  estimate  W.  be  a  function  of  the  sample  values. 


An  estimate  found  by  solving  equation  (16.25)  is  called  a  maximum  likelihood 
estimate  and  designated  CC  . 

In  the  general  case,  estimate  OC  is  not  the  best  of  the  possible  estimates 
and,  consequently,  may  have  some  bias  and  no  minimum  variance.  In  spite  of  this, 
th»  maximum  likelihood  method  has  several  very  valuable  properties.  They  are 
especially  valuable  in  those  cases  when  sample  values  y^,  .  .  . ,  yR  are  statistically 
independent.  Here,  the  following  postulations  demonstrated  in  mathematical  statistics 
[23]  are  valid: 

1.  If  efficient  estimate  (X*  exists  for  parameter  ,  then  maximum  likelihood 

.ft 

equation  (16.25)  has  a  single  solution  for  CC  .  In  other  words,  in  this  case, 
equation  (16.25)  has  only  one  root,  which  is  the  efficient  parameter  estimate. 

2.  If  sufficient  estimate  (X*  exists  for  parameter  CC ,  then  each  likelihood 
equation  root  is  a  function  of  CC  . 

3.  For  some  sufficiently-general  conditions  [23,  pg  54',  conditions  1— -3], 
estimate  CC  is  consistent,  asymptotically  efficient,  and  asymptotically  normal. 

This  last  property  is  especially  valuable  since  it  occurs  for  significantly-more 
general  cases  than  do  properties  1  and  2.  It  follows  from  this  that,  given  indepen¬ 
dent  samples  and  n  >1,  the  most-likely  estimate  is  the  very  best — it  has  essentially 
zero  bias  and  minimum  variance.  In  addition,  since  here  the  esti'..~ce  law  of  distri¬ 
bution  is  very  close  to  normal,  variance  magnitude  completely  characterizes  this 
law.  This  guarantees  that  the  estimate  found  is  very  good,  not  only  in  the  sense 
of  mean-square  error  magnitude,  but  also  in  any  other  sense. 


418 


Finally,  a  very  useful  special  feature  of  the  maximum  likelihood  method  is 
that,  in  this  case,  finding  estimate  (Jt  requires  determination  only  of  the  position 
of  the  function  L  (ylt  ....  yn\ a) maximum  with  respect  to  Ct  ;  therefore,  any  trans¬ 
formations  of  the  function  Hyr . yn |a)  form  which  do  not  cause  bias  (with  respect 

to  the  CX  axis)  of  its  maximum  do  not  impact  on  estimate  a**,  magnitude  and, 
consequently,  on  its  quality.  This  circumstance  considerably  simplifies  realization 
of  the  device  automatically  estimating  the  parameter. 


The  results  presented  easily  are  extended  to  a  case  of  several  unknown  parameters 
a,,  am.  Here,  estimates  a,**.  ....  cun*  of  these  parameters  are  solutions  of 
m  likelihood  equations  of  the  type 


• .  Vk  1  . . am)  _  f). 

dot  | 

dL  (</,.. 

•  ••  l/i»  !*!•••••  Bnt) _ rj 

(16.26) 


Estimates  found  in  this  way  for  independent  sample  values  y^,  •  .  . ,  yn  /290 
have  the  same  useful  properties;  for  example,  they  are  consistent,  asymptotically 
normal,  and  asymptotically  jointly  efficient. 


16.4  Use  of  the  Maximum  Likelihood  Method  for  Reception  of  Analog  Messages  on 
a  Noise  Background 


Let  oscillation  y(t)  at  receiver  input  have  the  form 


«/(/)=*«  (a, .  am;  0 +  «■(*)• 


(16.27) 


where  (0  —  fluctuating  noise,  in  the  general  case  having  a  nonstationary  process, 
a,,  am  —  unknown  signal  parameters,  which  in  the  general  case  may  be  time 
functions. 


Initially,  for  simplicity  we  will  assume  that  all  these  parameters  are  useful 
and  subject  to  estimation. 

It  follows  from  the  preceding  section  that  it  is  necessary  (as  a  rule)  to 
have  a  sufficiently-large  number  n  of  independent  sample  values  y.,  .  .  . ,  y 
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Figure  16.1 


Figure  16.2 


of  observed  magnitude  y  to  use  the  maximum  likelihood  method  to  obtain  good-enoutji 
estimates.  However,  in  this  case,  we  are  concerned  with  observation  of  a  random 
process,  which  also  happens  to  be  nonstationary,  rather  than  a  random  magnitude. 
Therefore,  in  the  general  case,  instead  of  n  independent  values  y^,  ...»  yn, 
we  must  have  n  realizations  yi((),  yn(t) .  yH(i)  of  process  y(t)  obtained  in  n  inde¬ 

pendent  tests  conducted  under  identical  conditions  (Figure  16.1). 

Here,  for  each  moment  t  =  ti  (Figure  16.1),  we  will  have  likelihood  functions 

—  .  yn  |0|, ... ,  am;  t), 


called  a  simultaneous  likelihood  function  since  all  sample  values  y^,  .  .  . ,  yn 
included  in  it  are  taken  from  n  realizations  for  the  identical  moment  in  time 
(Figure  16.1). 


Solving  m  likelihood  eqiatio  s  of  the  type 


dL  (<n , , 

•  •  i  0  _  n. 

fat 

•  «•••< 

•It  j)lll#l..' 

Vl 

1  •  •  ®mi  0  n 

dam 

(16.28) 


we  will  obtain  most-likely  estimates 


....  am*  for  a  given  moment  in  times 
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*1  =«I  (/); ... ;  am=am(t). 


F or  the  next  moment  in  time,  we  will  obtain  a  new  value  of  the  simultaneous 
likelihood  function,  new  set  of  likelihood  equations,  and  new  set  of  estimates 

®  I  ♦  •  •  *  *  ■ 

In  the  general  case,  when  needed  parameters  «i.  •••.  are  time  functions  /291 
and  noise  is  nonstationary,  resultant  estimates  «»*.  •••.  «m*also  are  time  functions 
and,  in  addition,  the  law  of  distribution  of  these  estimates  also  will  depend 
on  time. 

As  Grenander  demonstrated  [113],  estimates *»*•  •••.  *«*obtained  by  solving 
likelihood  eqiations  (16.28)  have  [given  independent  yt  (<).  ....  y*  (0  realizations] 
the  useful  asymptotic  (where  «-*■»)  properties  described  in  §  16.3,  i.  e., 
are  consistent,  asymptotically  normal,  and  asymptotically  jointly  efficient. 

However,  in  practical  problems  of  receiving  signals  on  a  noise  background, 
the  observer  (receiver)  has  at  the  point  of  reception  usually  just  one  realization 
of  the  y(t)  process,  rather  than  n  independent  realizations  of  this  random  process. 
Therefore,  of  primary  interest  are  cases  when  good  parameter  estimates  may  be 
obtained  from  analysis  of  just  one  realization  y(t)  during  some  finite  time  T. 

Grenander  [115]  demonstrated  that  most-likely  estimate?  obtained  from  analysis 
of  just  one  realization  in  interval  (0,  T)  have  the  same  useful  asymptotic  properties 
(consistency,  efficiency,  and  normal  law  of  distribution)  when  T -► <»  if  process 

y(t)  is  ergodic  and  have  certain  general  Markov  properties.'*' 

As  Slepian  demonstrated  [119],  in  particular,  a  case  when  noise <*n(/)is  an 
ergodic  process,  while  unknown  signal  parameters  o„  ....  am  are  consta  t  during 
the  time  of  observation,  boils  down  to  such  processes. 

+A  random  process  has  Markov  properties  if  the  last  course  of  this  process  only 
to  a  limited  degree  depends  on  its  preceding  flow.  This  restriction  placed  on 
the  nature  of  the  process  is  formed  mathematically  in  the  following  manner  [11, 

113,  and  others].  It  is  considered  that  a  process  has  all  the  more  general  Markov 
properties,  the  less  rigid  the  restriction  is  relative  to  the  degree  of  influence 
of  the  process' s  "history"  on  its  subsequent  development. 


Since  this  case  is  of  great  practical  interest,  we  will  examine  it  in  /292 
somewhat  more  detail.  Here,  for  simplicity,  we  will  limit  ourselves  to  a  case 
of  one  unknown  parameter,  i.  e. ,  we  will  assume  that 


y(t)=u(a;  0+“m  (0. 


(16.29) 


where  «m  (/)  —  noise,  being  an  ergodic  process,  “(«;/)  —  precisely-known  signal, 
with  the  exception  of  parameter  G(  subject  to  estimation  and  remaining  unchanged 
throughout  the  entire  time  of  observation  (0,  T). 

There  are  n  sample  values  (0i*  0i*  •••>  0")  of  total  oscillation  y(t)  during 

an  observation  cycle  (Figure  16.2). 

We  will  designate: 


Ui  =  m( a;  /,). 

Smi  ~  Wyi  (/|)t 

yi=y(*  i)- 


(16.30) 


Then,  from  (16.29),  we  have 


0l=«<  +  «,n|. 
i  1  •  2, ... ,  fi« 


(16.31) 


Here,  the  r>- dimensional  law  of  distribution  of  noise  («m,  ....  um *)  is  assumed 
to  be  precisely  known. 

Then,  in  accordance  with  (16.31),  the  law  of  distribution  of  sample  values 
(yv  .  .  .,  yp)  for  a  given  CC  equals 

L  Utl . 0„  I «)  -*  (01  — « I .  0„ 


i  •  0#  i 


/(0. . [0i -«(•»:  t i)r 

0*— «  (a; ./,) .  0»-u(a;  /,)|. 


(16.32) 


Since  u  (a;  tt)  —  known  functions  of  Ct  ,  then  the  right  side  of  equality 
(16.32)  is  a  precisely-known  function  of  sample  values  (y^,  ...»  yn)  and  parameter 

a  ;  consequently,  we  know  distribution  L  (y, . y„|a)and  we  must  find  parameter 

C^of  this  distribution. 

This  problem  coincides  completely  with  the  classic  problem  examined  in  preceding 
sections  solved  in  the  theory  of  statistical  estimations  and,  consequently,  all 
conclusions  presented  in  preceding  sections  are  valid  for  it. 

Thus,  for  example,  when  T -*■  oo  ,  it  is  possible  to  obtain  an  infinite  number 
of  statistically-independent  sample  values  y^,  .  .  .,  yn  and,  consequently,  when 
T-*-oo  ,  estimates  obtained  using  the  maximum  likelihood  method  must  be  consistent, 
asymptotically  efficient,  and  asymptotically  normal. 

Evidently,  all  computations  performed  above  for  a  case  of  one  unknown  parameter 
are  valid  also  for  any  (finite)  number  of  unknown  parameters  a,,  ....  am  .  Consequently, 
given  ergodic  noise  and  constancy  [during  interval  (0,  T)]  of  unknown  signal  /293 
parameters  a,. ....  am  ,  all  computations  and  conclusions  made  in  preceding  sections 
are  applicable  completely  to  estimation  of  these  parameters. 

Thus,  for  example,  if  a  jointly-efficient  combination  («j*,  ....  am*)  of  parameter 
(otj,  ....  am)  estimates  exists,  then  variance  of  the  estimate  of  any  of  these  para¬ 
meters  CXi  may  not  be  decreased  due  to  refusal  to  estimate  all  or  several  remaining 
unknown  parameters.'*' 

This  conclusion  is  especially  important  in  practice  since,  in  many  cases, 
reproduction  of  various  signal  parameters  has  varied  significance,  while  several 
of  the  parameters  carry  no  useful  information  of  any  kind,  i.  e.,  are  parasitic. 

Thus,  for  instance,  in  radbr  it  is  very  important  to  know  whether  it  is  possible 
to  increase  accuracy  in  measurement  of  target  range  (i.  e. ,  signal  lag  x  )  if 
one  fails  simultaneously  to  measure  target  radial  velocity  (i.  e.,  signal  frequency 
f)  and  vice  versa. 

+This  and  conclusions  stemming  from  it  apply  to  a  case  when  signal  characteristics 
are  given,  i.  e.,  may  not  be  changed,  given  an  estimate  refusal. 


The  theory  presented  above  makes  it  possible  in  several  cases  to  anauer  this 
question. 

For  example,  let  the  range  and  velocity  estimate  be  made  using  the  maximum 
likelihood  method  and  observation  time  T  be  so  great  that  it  is  possible  to  obtain 
a  very  large  number  of  independent  sample  values  (y.,  .  .  . ,  y  ).  Here,  as  follows 

1  n 

from  the  aforementioned,  resultant  estimates  and  fM  may  with  sufficient 
precision  (rising  with  an  increase  in  T)  be  considered  jointly  efficient.  Con¬ 
sequently,  given  such  conditions,  refusal  to  measure  frequency  will  not  increase 
range  measurement  accuracy  and  vice  versa.  In  other  words,  it  is  immaterial  from 
the  range  measurement  point  of  view  whether  frequency  f  is  a  parasitic  unknown 
parameter  or  if  it  is  a  known  parameter  subject  to  measurement. 

If  a  joint  efficient  combination  of  estimates  («i*.  at*)not  only  exists  when 
there  are  two  unknown  parameters  OC  and  GC^,  but  condition  (16.24)  also  i3 
met,  i.  e.,  =  0,  then  the  formula  for  parameter  OC  measurement  error  variance 

ol,  during  simultaneous  parameter  CC  ^  and  CC  ^  measurement  is  identical  to  that 
in  a  case  where  parameter  CC  2  is  precisely  known.  Consequently,  when  the  afore¬ 
mentioned  conditions  are  met,  the  fact  that  parameter  GC  ^  is  unknown  or  the  re¬ 
quirement  to  measure  it  does  not  impact  on  parameter  GC  ^  measurement  accuracy 
(and  vice  versa). 

These  examples  show  that  use  of  results  obtained  from  mathematical  statistics 
for  signal  reception  problems  makes  it  possible  in  several  cases  relatively  simply 
to  obtain  answers  to  very-important  practical  questions. 

We  now  will  explain  the  link  between  the  maximum  likelihood  method  and  the 
maximum  inverse  probability  density  method  presented  in  preceding  parts  of  the 
book. 

In  the  case  of  the  maximum  likelihood  method,  the  parameter  CC  value  /294 
corresponding  to  the  likelihood  fuiction  L  {y„  |o)  maximum  or,  which  is  the 

same  thing,  conditional  probability  density  P_(y),  considered  a  function  of  CC 
[see  equality  (16.1)3,  is  selected  as  estimate  CC  . 

If  the  maximum  inverse  probability  density  method  is  used,  that  value  of 


CC  corresponding  to  the  inverse  probsbility  density  Pg  (“)  maximum  is  selected 
as  the  CC*  estimate. 

But, 

P,{a)  =  kP(a)Pm(y).  (16.33) 

where  k  —  normalizing  constant. 

If  the  needed  parameter  CC  a  priori  distribution  is  uniform,  then 

P,(a)=*'P.(iO.  (16.34) 

where  k*  —  some  new  value  of  the  normalizing  constant  not  depending  on  CC  . 

It  follows  from  (16.34)  that  function  Pg(o)  and  Pa  (j/) maxima  with  respect 
to  CX  coincide.  Therefore,  the  maximum  inverse  probability  density  method  and 
the  maximum  likelihood  method  completely  coincide  when  the  parameter  CC  a  priori 
distribution  is  uniform. 

If  distribution  P(a)  is  irregular,  then  the  function  Pg(a)  and  P«  (y)  maxima 
with  respect  to  CC  do  not  coincide  and,  consequently,  estimates  found  using  the 
maximum  inverse  probability  density  method  and  the  maximum  likelihood  method  turn 
out  to  be  different.  If  a  priori  distribution  P(a)is  known,  then  the  estimate 
found  using  the  maximum  inverse  probability  density  method4'  turns  out  to  be  more 
efficient  since  additional  a  priori  information  about  the  signal— a  priori  distri¬ 
bution  P(«)  of  its  parameter,  is  considered  here. 

However,  as  noted  in  Parts  II  and  III  of  the  book  and  as  will  be  shown  in 
Chapter  19,  given  noise  in  the  form  of  normal  white  noise  (and  not  only  given 
such  noise)  and  a  sufficiently-high  signal-to-noise  ratio,  the  type  of  a  priori 
distribution  P  (a)  essentially  does  not  impact  upon  the  results  of  the  parameter 

+Tha  maximum  inverse  probability  density  method  sometimes  is  called  the  uncon¬ 
ditional  maximum  likelihood  method  since,  compered  to  it,  the  conventional  maximum 
likelihood  method,  which  does  not  consider  a  priori  distribution  f(a)  ,  is  conditional. 
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estimate.  Therefore,  under  such  conditions,  the  methods  of  maximum  inverse  pro¬ 
bability  density  and  maximum  likelihood  also  provide  essentially  identical  results. 

If  a  priori  distribution  P(&)  is  unknown,  then  it  is  assumed  to  be  uniform 
in  the  majority  of  cases  and  the  function  Pu  (a)  and  P*  ( y )  maxima  with  respect 
to  CL  also  coincide. 

Finally,  if  it  is  known  that  parameter  C(  by  nature  is  not  random,  then 
a  priori  distribution  P  (a)  from  the  physical  point  of  view  loses  its  meaning  and 
only  likelihood  function  P*(y)  retains  meaning.  However,  since  function  /*„  (a)  / 295 
and  Pa  (y)  maxima  with  respect  to  CX  coincide  when  P(a)  =  const,  it  is  possible 
from  the  mathematical  point  of  view  and  when  parameter  CC  is  not  random  to  use 
the  maximum  inverse  probability  density  method  if  you  assume  that  P  (“)  =  const 
here. 

It  follows  from  what  has  been  said  that,  in  a  majority  of  the  interesting 
cases,  methods  of  maximum  inverse  probability  density  and  maximum  likelihood 
essentially  coincide. 

Evidently,  everything  stated  above  will  apply  not  only  to  a  case  of  estimating 
one  parameter,  but  to  cases  of  estimating  several  unknown  parameters  as  well. 

Examples  of  the  use  of  the  maximum  inverse  probability  density  method  (and, 
consequently,  the  maximum  likelihood  method  as  well)  for  estimation  of  one  parameter 
were  examined  in  detail  in  Parts  II  and  III  of  the  book  (Chapters  4,  6,  7,  and 
13). 

Use  of  the  maximum  likelihood  method  for  simultaneous  estimation  of  two  signal 
parameters  was  studied  in  detail  for  the  first  time  by  S.  Ye.  Fal'kovich  [5]. 

Some  results  from  this  study  will  be  presented  for  illustration  in  the  next  section. 

16. 5  Simultaneous  Signal  Frequency  and  Lag  Estimation 

Let  this  oscillation  be  at  receiver  input 
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where  um(t ) —  normal  white  noise  with  spectral  density  N I  j  u  (t)  —  signal,  ttfiich 

o  c 

has  the  form 

ue  (/)  *  a  (/— t)  cos  [2*  (/,— F)  t  +  <p  (0+9.1;  (16.35) 

here,  —  random  initial  phase  with  a  uniform  distribution  in  the  0  to  2n] 
range,  while  lag  t  and  frequency  shift  F  —  unknown  parameters  subject  to  estimation. 
Frequency  f  and  functions  9  (t)  and  a(t  —  t)  are  precisely  known  (the  latter, 
with  the  exception  of  %) .  The  assumption  is  that,  during  time  of  observation  (0,  T) 
i  cfi,.  Q,  f  ,  and  F  remain  unchanged. 

It  is  convenient  to  write  oscillation  (16.35)  in  the  following  form: 

«o(0«Re(/<f.  T.  F)yw++>.  (16.36) 


where 


U(i,  t,  f)«o(/— T)e/t-**jr,+^“» 


(16.37) 


is  complex  signal  envelope. 

In  the  particular  case  where  F  s  0  and  f(t)  =  0,  envelope  U(t,  ,  F)  is 
a  real  function. 


As  will  be  clear  from  what  follows,  accuracy  in  estimation  of  7  and  F  is 
determined  by  the  type  of  function 


¥(t„  Fv  t„  F,) 


U(t,  x„  Fi)U*(t,  x„  FJdt 


(16.38) 


where  U*  denotes  a  complex  magnitude  conjugate  with  U,  while  vertical  lines  /296 
show  that  the  modulus  is  taken  from  the  corresponding  complex  magnitude;  Q  — 
signal  energy,  i.  e. , 


Hu 


.*  v, 
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(16.39) 


q“t  f 

Function  4^  attains  the  greatest  value  (unity)  when  =  T^,  F^  =  F^  and 
decreases  (it  does  not  have  to  be  monotonically)  by  virtue  of  an  increase  in  the 
deviation  of  parameters  and  F  ^  from  and  F^,  respectively. 

Consequently,  4/  characterizes  the  link  between  complex  envelopes  corresponding 
to  the  two  different  unknown  parameter  combinations:  (  T^,  F^)  and  (  and 
F^).  Therefore,  function  4^  sometimes  is  called  the  signal  correlation  function. 
However,  to  avoid  confusion  with  the  correlation  function  used  in  the  theory  of 
random  processes,  we  will  call  function  4/  a  signal  function  or  a  function  of 
"correlation. " 


For  the  majority  of  actually-used  signals,  function  4/  will  depend  not  on 
values  Tv  F^  and  T^,  F^  themselves,  but  only  on  differences 


t'  — t,  and  **  — f, . 


(16.39a) 


Here,  relationship  (16.38)  takes  the  following  simplified  form: 


▼<«'.  n-± 


J  U(i,  f,  0. 0 )<u 


(16.40) 


Assuming  that  signal  function  4/  has  the  same  simplified  form,  while  the 
signal-to-noise  ratio  is  sufficiently  high,  Fal’kovich  used  the  maximum  likelihood 
method  to  obtain  the  following  expressions  for  parameter  T^and  F  estimate  error 


where 


(16.42) 
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Here,  mean  square  errors  6?  and  8?  and  correlation  coefficient  P«. *  have 
exactly  the  same  meaning  as  in  previous  sections. + 

In  the  case  under  examination,  estimates  *t*  and  F  of  parameters  T'and  F 
are  (asymptotically)  unbiased,  jointly  efficient,  and  with  a  normal  law  of  dis¬ 
tribution. 

* 

Since  estimates  X  and  F  are  jointly  efficient,  refusal  to  estimate  frequency 
F  may  not  increase  parameter  '^'measurement  accuracy  and  vice  versa.**  In  addition, 
relationships  (16.23)  must  be  valid  for  jointly-efficient  estimates. 

If,  during  the  parameter  T'estimate,  frequency  shift  F  is  precisely  known 
or,  vice  versa,  during  the  frequency  shift  F  estimate,  lag  T' is  known,  then  one 
should  assume  that  s  0  in  formulas  (16.41)  and  (16.42). 


The  result  here  1st 


(16.43) 


^Expressions  (16.41)  and  (16.42)  are  valid  for  signals  with  known  amplitude. 
When  signal  amplitude  is  random,  then  these  expressions  will  remain  valid  if 
and  SJF  are  understood  to  be  conditional,  rather  than  unconditional,  mean  squares 
determined  for  a  given  amplitude  value  (given  signal-to- noise  ratio  q). 


**See  page  423  footnote. 
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It  follows  from  comparison  of  expressions  (16.41)  and  (16.43)  that,  when 

X  “■  "T  —X 

pt s  0  (or  Pt.  r  s  0),  mean  square  errors  6«  and  6}  during  simultaneous  measurement 
of  two  parameters  are  determined  from  the  same  formulas  as  is  the  case  when  one 
of  these  parameters  is  precisely  known. 

We  will  use  a  signal  having  the  form  of  a  pulse  with  a  bell-shaped  envelope 
as  our  example 


a  (/— t)  =  0,*-*  «'-*>• 

and  frequency  occupation  equalling  f  +  F,  i.  e.,  w(t)  s  0. 

Here,  from  (16.37)  we  obtain 

U(t,  t,  F) 


(16.44) 


/298 


and  signal  function  (T •  V7 )  ,  in  accordance  with  (16.40),  has  the  form 

r* 


V(t'.  F>)=± 


Following  simple  transformations,  we  obtain 

V(t\ 


(16.45) 


where 


z  — 


(16.46) 


Expression  (16.46)  may  be  considered  a  Fourier  transform  from  integrand 


and  may  be  found  using  conventional  methods  of  operational  calculus  or  from  tables. 
Then,  we  will  obtain 


and,  in  accordance  with  (16.45) 

V(t',  j//rZe"£V,\ 

But,  it  follows  from  (16.39)  and  (16.44)  that 

Substituting  this  expression  into  (16.47),  we  have 

-—»,*)»  (F*)» 

T(t’.  f')»e  *  e  * 

As  could  be  expected,  the  result  is 

V(0.  0)-l, 

and,  with  a  /£*1  and  F'  increase,  magnitude  4^  decreases. 

Considering  (16.49),  formulas  (16.42)  provide 

0*  =c;  0^»»  — ; 

C 


Substituting  these  results  into  formulas  (16.41),  we  have 


2 

8 

ft.* 


j_ . 

2*c’ 

c 

2$«* 

0. 


(16.47) 

(16.48) 

(16.49) 
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Since,  the  result  is  Pt.r  -  q  j.n  this  case,  then  mean  square  errors  SP  and 


Sp  are  determined  in  the  identical  manner  as  in  the  case  of  estimation  of  one 
of  these  parameters  when  the  second  parameter  is  precisely  known. + 

It  follows  from  formula  (16.50)  that,  in  this  case,  mean-square  errors 

and  $7*  will  depend  only  on  signal- to-noise  power  ratio  q  and  on  parameter 
c,  which  characterizes  pulse  duration  [see  expression  (16.44)]  and,  consequently, 
the  width  of  its  spectrum:  the  greater  the  c,  the  shorter  the  pulse  duration 
and  the  broader  its  spectrum.  Therefore,  it  is  quite  understandable  that,  with 
an  increase  in  c,  the  mean  square  lag  measurement  error  decreases,  while  the 
mean-square  frequency  shift  F  measurement  error  increases. 

The  reader  will  find  the  principles  of  constructing  an  optimum  receiver  for 
simultaneous  measurement  of  two  parameters,  as  well  as  additional  problems  involving 
optimum  estimation  of  signal  parameters,  in  Fal'kovich's  book  [5].  We  will  dwell 
briefly  only  on  one  of  the  most- important  problems — the  influence  of  signal  shape 
during  simultaneous  measurement  of  two  parameters.  Here,  for  specificity,  we 
will  introduce  an  examination  applicable  to  radar,  i.  e. ,  we  will  assume  that 
lag  'C  is  a  measure  of  an  object's  (aircraft  and  so  on)  range,  while  frequency 
shift  F  is  a  measure  of  this  object's  radial  velocity. 

This  problem  was  examined  in  several  works,  including  those  of  Woodward  [2], 
Siebert  [105],  and  in  the  aforementioned  Fal'kovich  book  [5].  Several  fundamental 
results  of  this  examination  will  be  presented  in  the  next  section. 


However,  it  still  does  not  follow  from  this  that,  when  pt.f  =  0,  the  necessity 
for  simultaneous  parameter  F  measuremept  may  not  in  any  way  influence  the  accuracy 
achieved  in  measurement  of  parameter  u( and  vice  versa). 

Actually,  it  follows  from  formula  (16.50)  that,  for  given  signal-to-noise  ratio  q, 

a  decrease  in  error  /«i  requires  a  decrease  in  the  magnitude  of  coefficient  c 

(i.  e. ,  an  increase  in  pulse  duration).  But,  here,  the  error  V~&\.  magnitude  will 

rise.  Therefore,  if  the  coefficient  c  increase  during  practical  system  accomplishment 

is  restricted  by  this  same  error  /sf  rise  above  the  permissible  level  (and  not 

by  any  other  circumstances,  such  as  design  approaches),  then  the  necessity  for 
sufficiently-accurate  parameter  F  measurement  may  lead  to  an  increase  in  parameter 
T  measurement  error  (and  vice  versa).  Consequently,  when  px.f  -  0,  this  hidden 
influence  of  the  necessity  for  parameter  F  measurement  on  parameter  X  measurement 
accuracy  (and  vice  versa)  may  occur. 


■r.y.  -v  «v-% 


c-j 
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16.6  Impact  of  Signal  Shape  During  Simultaneous  Frequency  and  Lag 
Measurement 


/30Q 


As  follows  from  formulas  (16.41)  and  (16.42),  parameter  T^and  F  measurement 
accuracy  will  depend,  given  signal-to-noise  power  ratio  q,  only  on  the  type  of 
signal  function  ^  (T*.  F')  ,  Therefore,  one  of  the  basic  (if  not  the  basic)  optimi¬ 
zations  of  the  signal  uc(t)  shape  is  the  proper  type  of  signal  function  V  (t';  F') . 

Above  plane  (”£'#  F'),  this  function  forms  a  surface,  whose  vertex  is  located 
at  the  origin  of  the  coordinates  and  which  equals  unity  since  always 


m  0)-l.  (16.51) 

By  virtue  of  the^T'  and  F*  increase,  function  ^ (V.  decreases  monotonically 
or  not  monotonically,  depending  on  signal  uc(t)  type.  In  the  latter  case,  surface 

has,  besides  a  central  vertex,  other  additional  vertices  of  lesser  magnitude. 
However,  for  any  signal  type,  the  complete  volume  formed  by  surface  (*',  F')  with 
plane  ( 0, “XT ’ ,  F ;)  turns  out  to  be  identical  and  equalling  unity,  i.  e.,  always 

]  J  Y(t\  F')dx'dF;  =  l.  (16.52) 

In  radar,  this  relationship  is  called  the  ambiguity  principle.  [The  validity 
of  relationship  (16.52)  is  demonstrated  in  general  form  by  substitution  of  expression 
(16.40)  into  formula  (16.52)  and  performance  of  the  corresponding  mathematical 
transforms].  The  same  procedure  is  used  for  convenience  in  geometric  interpretation 
of  surface  V(t\  F')  and  the  volume  it  forms  as  is  used  to  depict  mountain  peaks 
on  a  map-sections  of  surface  V  (*'.  F')  with  horizontal  planes  are  drawn  at  different 
levels  ("altitudes")  and  sections  corresponding  to  different  levels  are  noted 
on  plane  (0, 'f' ,  F’)»  using  different  densities  of  shading  for  instance;  the  higher 
the  level,  the  denser  the  shading. 

Usually,  three  gradations  of  magnitude  4/  are  used  for  an  approximate  /301 
characterization  of  the  type  of  surface  F')  ,  namely:  solid  sections  of 

plane  (0,  X%  »  F * )  correspond  to  regions  of  high  "correlation,"  i.  e. , 

Y(t',  F')»l  ;  shading  corresponds  to  slight  "correlation,"  i.  e.  to  regions  where 


V* 


1 

0<  ,F(t',  l  }  finally,  light  sections  of  plane  (O,^',  F')  apply  to  the  "uncor¬ 
related"  region  where  V  (t\  F')  —  0. 

Formula  (16.49)  describes  the  signal  function  for  a  signal  in  the  form  of 
a  sinusoidal  pulse  with  a  bell-shaped  envelope  and  surface  section  V  (t\  F')  of 


n 


< 


a  horizontal  plane  has  the  form  of  an  ellipse  with  semi-axis  ratio 


b  c 

a  n 


(16.53) 


Therefore,  Figure  16.3  characterizes  the  type  of  function  V(i\  F')  for  such  a 
signal. 


& 


Figure  16.4  corresponds  to  a  sinusoidal  pulse  with  a  rectangular  envelope 
with  duration  t„  .  ’ 

A  train  of  four  such  coherent  pulses  (Figure  16.5a)  corresponds  to  Figure 
16.5b.  Evidently,  surface  ¥(t \  F'),  having  one  vertex  and  no  region  of  zero 
"correlation"  [since,  in  this  case,  ¥(t',  F')  equals  zero  only  when 
or  F’  -*■  oo|.  corresponds  to  Figure  16,3.  A  surface  with  one  vertex  having  a 
region  both  with  large  and  small,  as  well  as  with  zero,  "correlation"  (the  latter 
occurs  when  K  1>T»  )  corresponds  to  Figure  16.4. 

Finally,  a  surface  with  a  large  number  of  vertices  and  with  regions  /302 
with  a  large,  small,  and  zero  "correlations"  corresponds  to  Figure  16.5.  Appropriate 
figures  may  be  drawn  for  other  signal  types  as  well. 
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Figure  16.5 


When  solving  the  problem  of  which  type  of  function  ^  (T*.  F*)  [and,  consequently 
of  signal  uc(t)]  is  the  best,  one  must  consider  a  series  of  circumstances,  with 
the  following  being  the  most  important: 

1.  The  requirement  for  high  parameter  t  and  F  measurement  accuracy  and  the 
absence  of  measurement  ambiguity. 

2.  The  requirement  for  hi#  resolution. 

3.  Possible  simplification  of  the  measurement  system  (racbr). 

It  is  easy  to  become  convinced  that  the  following  conditions  must  be  met 
in  or  tr  to  increase  parameter  Tand  F  measurement  accuracy  and  eliminate  function 
▼(*'.  F')  ambiguity: 

a)  Only  one  (central)  region  of  high  "correlation"  must  exist  within  a  given 
range  of  possible  measured  magnitude  ( 7*  and  F)  values. 

b)  This  region  must  be  as  small  as  possible. 

The  first  of  these  conditions  is  required  to  eliminate  measurement  ambiguity, 


while  the  second  is  needed  to  increase  measurement  accuracy.  The  following  compu¬ 
tations  confirm  the  validity  of  these  postulations. 

Insuring  high  parameter  X and  F  measurement  accuracy  in  the  presence  of  noise 
requires  that  even  very  slight  changes  in  these  parameters  cause  significant  changes 
in  signal  u  (t,Xt  F)  shape  since,  otherwise,  the  probability  is  great  that  parameter 

v 

changes  will  not  be  detected  due  to  the  changing  noise  action.  But,  it  is  evident 
from  relationships  (16.39a)  and  (16.40)  that,  given  absolutely  no  parameter  changes 
(where  ^  ^  and  F^  =  F^,  i.  e.,  when*?'  =  0  and  F'  s  0),  the  result  is 

V(t\ 

Consequently,  zero  or  infinitely-slight  parameter  changes,  which  in  principle 
are  impossible  to  detect  even  given  the  slightest  possible  noise,  correspond  to 

values  equalling  or  infinitely-close  to  unity.  Therefore,  the  region  of  function 
V  (x't  F')  values  in  a  range  where  V  (t',  F) tte  1,  i.  e. ,  a  region  of  high  "correlation," 
is  called  the  ambiguity  region. 

The  smaller  the  ambiguity  region,  i.  e.,  the  region  of  high  "correlation" 
turns  out  to  be,  the  higher  the  parameter  X and  F  measurement  accuracy. 

Hence,  it  also  follows  that,  if  not  one,  but  several  regions  of  high  "correla¬ 
tion,"  i.  e.,  ambiguity  regions,  fall  within  the  possible  measured  parameter  values, 
then  there  is  ambiguity  in  determination  of  X  and  F  magnitude.  Thus,  for  example, 
in  the  case  of  a  pulse  train  (Figure  16.5),  measurement  ambiguity  arises  if  T* 
or  F‘  depart  bounds  ±(TB  —  r9)  and  ±  (fj  —  jf)  ,  respectively. 

We  now  will  explain  how  function  V(t',  F')  type  [i.  e.,  signal  u  (t)  /3Q3 

c 

type]  influences  measurement  system  (radar)  resolution. 

First,  we  will  examine  the  simplest  case  when  there  is  a  total  of  two  objects 
in  space — a  given  object  with  parameters  ( X ^  and  F^)  and  interfering  object  with 
parameters  ( X^  and  F^).  We  will  assume  that  signals  uQ(t, '7^,  F^)  and 
uc(t,  X^t  Fg)  reflected  from  both  objects  differ  only  in  the  values  of  parameters 
Tand  F.  Then,  it  again  is  possible  to  form  a  (16.40)-type  "correlation"  function 
for  these  signals  and  the  figures  plotted  on  its  basis,  such  as  Figures  16.3, 

16.4,  and  16.5,  tum  out  to  be  valid. 


If  both  objects  have  identical  parameter  ! and  F  values  (i.  e. ,  'X?'  =  0  and 
F'  s  0),  then  they  in  principle  are  indistinguishable.  Here,  as  follows  from 
(16.40),  the  result  is  VO*',  F‘)  =  1. 

Consequently,  if  function  V(r\F’)  equals  or  is  infinitely-close  to  unity, 
then  both  objects  are  indistinguishable,  regardless  how  slight  the  noise  is. 

Hence,  it  follows  that  regions  of  high  "correlation"  [where  Hare 

ambiguity  regions,  not  only  from  the  point  of  view  of  the  impossibility  of  accurate 
measurement  of  the  parameters  of  a  single  object,  but  from  the  point  of  view  of 
the  impossibility  (in  this  region  of  parameters)  of  distinguishing  one  abject 
from  another,  i.  e. ,  from  the  point  of  view  of  system  resolution.  Therefore, 
it  also  is  desirable  from  the  resolution  standpoint  that  the  region  of  "correlation" 
be  as  narrow  as  possible. 

The  lower  the  function  Y(t',  F’)  value  for  signals  from  two  objects,  the  easier 
it  is  to  distinguish  them  from  each  other  in  the  presence  of  noise  and,  consequently 
the  higher  the  system  resolution. 

As  pointed  out  above, 7 ■»  Ois  the  case  for  orthogonal  signals.  Therefore, 
the  solution  turns  out  to  be  complete  for  such  signals — it  is  possible  completely 
to  avoid  the  interference  from  signal  uc(t,  T^,  F2)  when  signal  uc(t,^,  F^) 
is  being  received  (if  orthogonality  is  achieved  by  the  signals  not  overlapping 
over  time  or  with  respect  to  frequency  spectrum,  then  such  a  complete  solution 
is  achieved  through  time  or  frequency  gating,  respectively). 

When,  instead  of  one,  there  are  several  interfering  signals,  the  problem 
of  signal  shape  [or  of  function  ▼(•»',  F')  type]  influence  on  system  resolution 


Figure  16.6 


is  complicated  greatly.  In  particular,  function  V(t',  f')  type  in  regions  of  slight 
"correlation"  (in  the  shaded  regions  of  Figures  16.3 — 16.5)  acquires  great  signi¬ 
ficance.  However,  in  this  case  also,  the  ideal  signal  would  be  the  one  (Figure 
16.6)  in  which  the  region  of  high  "correlation"  would  be  as  small  as  possible, 
while  there  would  be  no  region  of  slight  "correlation"  (in  Figure  16.6,  Tn  denotes 
signal  duration,  while /7C  —  width  of  its  spectrum;  nexa  >  |  ).  Evidently, 
such  a  signal  is  ideal  also  from  the  standpoint  of  greatest  measurement  accuracy 
and  absence  of  ambiguity.  However,  obtaining  the  ideal  signal  encounters  serious 
principle  and  practical  difficulties. 

The  principle  difference  involves  relationship  (16.52)  presented  above,  /304 
i.  e.,  the  "ambiguity  principle,"  from  which  it  follows  that  the  complete  volume 
beneath  surface  ’?(*',  F')  always  equals  unity. 

Actually,  obtaining  the  ideal  signal  (Figure  16.6)  means  making  that  part 
of  the  region  beneath  surface  V  (*',  F'),  located  near  the  origin  of  the  coordinates 
as  small  as  possible.  But,  since  the  complete  volume  beneath  surface 
’Pf*'./7')  must  remain  unchanged  (equal  to  unity),  then  an  increase  in  part  of 
the  volume  located  outside  the  central  region  of  high  "correlation"  unavoidably 
increases  here.  This  signifies  that,  when  there  is  a  decrease  in  the  central 
region  of  high  "correlation,"  the  level  of  "correlation"  outside  of  this  region 
must  be  increased  and  additional  regions  of  high  "correlation"  even  may  appear. 

This  may  lead  to  a  deterioration  in  resolution  or  onset  of  ambiguity  (if  the 
additional  regions  of  high  "correlation"  turn  out  to  be  within  the  range  of  possible 
measured  parameter  values).  Practical  problems  of  obtaining  a  signal  that  is 
close  to  ideal  involve  equipment  complexity. 

It  is  pointed  out  in  [103]  that  noise-like  signals  or  signals  in  the  form 
of  a  coded  pulse  train  fall  in  the  category  of  signals  with  a  function  'P(t',F'), 
close  to  ideal.  Here,  a  coded  signal  is  obtained,  for  example,  from  an  initial 
pulse  of  duration  t„  with  sinusoidal  occupation,  which  will  be  divided  into  /7ct„ 
intervals,  each  with  duration  ^  (here,  Ta  —  duration  of  the  entire  signal, 
while  I7e  —  width  of  its  spectrum).  Then,  rf  oscillations  during  transition  from 
one  given  interval  to  the  next  are  retained  unchanged  or  are  rotated  in  phase 
180°-- in  accordance  with  whether  the  corresponding  code  character  in  binary  code 
comprising  ner9  code  characters  equals  zero  or  unity.  The  selected  code  is  such 


that  the  "correlation"  function  minimum  is  insured  for  allT*'  an d  F* 

values  that  differ  from  zero. 

In  conclusion,  the  following,  very  important,  circumstance  that  has  not  had 
suffcient  attention  devoted  to  it  in  the  literature  should  be  underscored.  All 
conclusions  made  above  concerning  the  influence  of  the  type  of  signal  function 
(and,  consequently,  the  signal  type  as  well)  on  parameter  'Fand  F 
measurement  sccuracy  completely  are  valid  only  given  sufficiently-high  signal- to-noise 
ratio  q. 

This  is  because,  first,  only  in  this  case  are  initial  formulas  (16.41)  and 
(16.42)  valid.  Therefore,  only  in  the  event  of  a  hig^  signal- to-noise  ratio  is 
it  possible  to  affirm  that,  for  a  given  signal- to- noise  power  ratio,  parameter  T 
and  F  measurement  accuracy  completely  ia  determined  by  the  type  of  signal  /3Q5 
function  V  (t\  f). 

Second,  it  is  possible  from  the  following  computations  to  become  convinced 
that  the  above  conclusions  may  be  completely  invalid  for  a  low  signal-to-noise 
ratio. 

It  was  noted  above  that,  in  the  case  of  noise-like  and  coded  signals,  it 
is  possible  from  the  measurement  accuracy  and  ambiguity  standpoint  to  come  close 
to  the  ideal  caae  depicted  in  Figure  16.6.  But,  it  follows  from  this  figure  that, 
when  nfm  ,  the  region  of  hi<Ji  "correlation"  will  strive  towards  zero. 

This  signifies  that  it  is  possible,  using  an  unlimited  increase  in  the  nueber 
of  signal  degrees  of  freedom,  to  make  a  parameter  T'and  F  measurement  error 
as  sli^it  as  desired,  even  while  keeping  signal-to-noise  power  ratio  q  unchanged 
and  finite.  Therefore,  if  you  assume  that  this  result  is  valid  for  any  magnitude 
q,  then  this  will  denote  that  it  is  possible  by  means  of  an  unlimited  increase 
in  T*  to  bring  the  measurement  error  to  zero,  even  for  a  very  low  (but  finite) 
signal-to-noise  ratio. 

But,  this  result  contradicts  fact  since  it  is  impossible,  given  low  signal- 
to-noise  ratio  q,  regardless  of  the  signal  u£(t)  shape,  to  solve  even  the  signal/no 
signal  at  receiver  input  problem  reliably  enough  and,  consequently,  it  is  impossible 
even  to  guarantee  that  l^and  F  measurement  results  will  apply  to  a  usable  signal 
and  not  to  noise,  i.  e.,  regardless  of  the  degree  to  which  they  are  usable. 
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Actually,  solution  of  the  signal/no  signal  at  input  problem  is  nothing  but 
binary  signal  detection.  It  was  demonstrated  in  Chapters  S  and  9  that,  for  simple 
binary  signal  detection,  detection  error  probabilities  will  depend  exclusively 
on  the  signal-to-noise  power  ratio  [see,  formulas  (5.29),  (9.49),  and  (9.83,  for 
example]  and,  when  the  value  of  this  ratio  is  low,  reliable  detection  is  impossible, 
regardless  of  signal  shape.  In  the  example  of  parameter  T'and  F  measurement  examined, 
we  are  dealing  with  complex,  not  simple,  binary  detection  (since  parameters  'Z' 
and/or  F  are  unknown  during  signal  detection).  But,  it  is  evident  that,  if  the 
given  signal-to-noise  power  ratio  is  insufficient  for  reliable  simple  detection, 
then  it  is  all  the  more  insufficient  for  the  same  level  of  complex  detection  reli¬ 
ability.  Therefore,  it  is  evident  that,  when  q  is  low,  achievement  of  high  detection 
reliability  and/or  high  parameter  iTand  F  measurement  accuracy  in  principle  is 
impossible,  regardless  of  signal  shape. 

It  fallows  from  what  has  been  stated  that  signal  shape  complexity  (increasing 
its  number  of  degrees  of  freedom  )  for  the  purpose  of  decreasing  the  region 
of  high  "correlation"  (Figure  16.6)  and  increasing  measurement  accuracy  actually 
may  provide  an  increase  in  accuracy  only  when  signal-to-noise  ratio  q  is  sufficiently 
hicji.  Here,  the  greater  the  number  of  degrees  of  freedom  /7„T«  ,  the  greater  magnitude 
q  must  be  in  order  for  the  results  obtained  to  be  valid. 

Here,  we  have  an  analogy  with  a  simpler  case— measurement  of  one  parameter 
(x),  examined  in  Chapter  6.  Here,  it  was  demonstrated  from  the  analytical  results 
Kotel'nikav  obtained  and  geometric  interpretation  that  measurement  accuracy  /306 
increases  without  restraint  when  the  number  of  signal  degrees  of  freedom  increases, 
but  signal-to-noise  ratio  q  required  for  realization  of  this  accuracy  rises  simul¬ 
taneously— otherwise,  anomalous  errors  unavoidably  arise  [a  skip  from  one  multi¬ 
dimensional  spiral  winding  (Figure  6.8)  to  another]  and,  consequently,  accuracy 
deteriorates  sharply. 

The  problem  of  the  impact  of  signal  shape  on  radio  reception  noise  immunity 
is  examined  in  some  detail  in  many  works  [5,  127—129,  135,  189—191,  and  others]. 
However,  the  examination  considers  anomalous  errors  only  in  several  of  them  [190, 

191,  and  others]. 
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GENERALIZED  OPTIMIZATIONS  (USE  OF  THE  THEORY  OF  STATISTICAL  DECISIONS) 

17.1  Problem  Formulation 

t  The  examination  in  preceding  chapters  applied  to  more-widespread,  but  particular, 

receiver  optimizatons — to  criteria  of  minimum  composite  error  probability,  minimum 
root -mean- square  error,  and  others.  However,  in  several  cases,  selection  of  a 
specific  optimization  type  turns  out  to  be  difficult.  Therefore,  it  is  very  important 
to  know  how  the  selected  optimization  type  impacts  on  optimum  receiver  structure 
I  and  properties.  This  means  that  it  is  necessary  to  introduce  generalized  optimi¬ 

zations,  which  would  encompass  complete  classes  of  particulsr  criteria.  This 
approach  turns  out  to  be  possible  when  the  theory  of  statistical  decisions  is 
|  used. 

The  theory  of  statistical  decisions,  the  foundations  for  which  were  laid 
by  Wald  (30,  16,  118],  is  a  development  and  generalization  of  methods  of  testing 
j  statistical  hypotheses  and  estimating  distribution  parameters  and,  to  a  significant 

|  degree,  uses  results  from  game  theory.  Therefore,  such  terminology  characteristic 

|  of  game  theory  as  "decision, "  "loss"  (or  "deficit"),  "risk,"  and  so  on  are  used, 

in  particular,  in  the  theory  of  statistical  decisions. 

i 

t 


In  accordance  with  this  theory,  the  problem  of  optimum  reception  of  a  signal 
|  on  a  noise  background  is  formulated  in  the  following  way. 
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j 

Mixture  (not  mandatorily  additive)  y(t)  of  signal  u  (t)  and  noise«m  (0.  »rrives 

C 

|  at  receiver  input  (Figure  17.1),  i.  e., 

y(/)=«c(/>®  «„,(/). 

i  Here,  the  character  <3  denotes  “mixture." 


ut(t! 


(*0 
y(tj\  ftVUtMHUK 

A(r\y) 


Figure  17.1.  (a)  —  Receiver. 

In  a  particular  case,  given  additive  signal  and  noise,  this  mixture  is  /307 
a  sum,  i.  e.. 


#(0-M0+«m<0. 


Noise  statistical  characteristics  are  assumed  to  be  precisely  known,  i.  e., 
the  r>- dimensional  law  of  noise  distribution  Wm  (««>.  is  known.  The  signal 
may  be  represented  in  the  following  form: 

MO-M*;  x ;  o, . oj,  (17.1) 

where  x,  at,  ....  am  —  signal  parameters  unknown  at  the  point  of  reception.  These 
parameters  during  time  of  observation  (0,  T)  may  be  unknown  constant  magnitude 
or  unknown  time  functions  x(t)j  W»  •••* 

Here,  x  is  a  usable  message  subject  to  reproduction,  while  au  ....  <*m  —  parasitic 
parameters,  i.  e.,  parameters  containing  no  information  concerning  the  reproduced 
message. 

Unknown  magnitudes  (or  time  functions)  x^,  «i.  ....  am  at  the  point  of  reception 
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are  considered  random  magnitudes  (or  random  time  functions)  having  known  a  priori 
distributions  P(x)  and  'P  (alt  ....  om). 

Since  parasitic  parameters  ‘a,,  ....  am  do  not  comprise  any  information  about 
message  x,  there  is  no  statistical  link  between  these  parameters  and  message  x. 

It  is  assumed  that  signal  u  (t)  dependence  on  time  t  and  parameters  (x;  o„  ....  am) 
at  the  point  of  reception  are  precisely  known;  consequently,  given  known  parameters 

>  <*i . •  am)  ,  signal  u  (t)  is  precisely  known.  This  signal  is  called  a  signal 

with  unknown  or  random  parameters.  The  theory  presented  below  also  is  valid  for 
completely-random  signals  (a  priori  n- dimensional  distributions  known  at  the  point 
of  reception)  [120].  However,  since  signals  with  unknown  parameters  are  more 
typical  in  radio  engineering,  further  exposition  applies  to  a  (17.1)-type  signal. 

Since  signal  and  noise  statistical  characteristics  are  assumed  to  be  precisely 
known,  the  following  joint  distribution  is  known  or  may  be  ac  rately  computed 

P{x,y)=P(x)P,(9)>  (17.2) 

This  distribution  more  simply  will  be  found,  given  additive  signal  and  noise. 
In  this  case 


?(*)-««(*;  x;  a, . (17.3) 

therefore 


Ps.m, . •-(P)='^B(y—«e) 


(17.4) 

/308 


P*(y) 


i  "i . 


. i  -dam. 


(17.5) 


Consequently, 


J  •••  f  ^■(ll“Bc)^(*ii  •••)  am)  dat ...  dam, 

^  (17.6) 
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where 


ue»ue(/;  x\  a, . am). 


Since  distributions  (uB)and  P  (oj,  ....  am)  are  assumed  to  be  known,  then 
formula  (17.6)  makes  it  possible  to  determine  distribution  Px(y).  Following  this, 
joint  distribution  P(x,  y)  may  be  found  from  formula  (17.2). 

So,  signal-plus-noise  y(t),  whose  statistical  characteristics  are  precisely 
known,  arrives  at  receiver  input  (Figure  17.1).  The  receiving  device  analyzes 
oscillation  y(t)  during  time  of  observation  (0,  T)  and,  based  upon  this  analysis, 
supplies  decision  7  at  receiver  output.  The  operations  performed  in  the  receiver 
with  oscillation  y(t)  to  form  decision  “If  are  called  a  decision  rule  and  are  designated 
A(V  \U)-  .  The  type  of  decision  If  will  depend  an  receiver  purpose. 


If  the  task  is  just  binary  signal  detection,  then  7  has  a  total  of  two  discrete 
values  70  ("no  signal")  and  7^  ("signal"). 


For  simple  reproduction  of  discrete  messages  x^,  .  .  .,  xm,  decision  7has 
corresponding  discrete  values  7^»  •  •  •»  m»  For  simple  reproduction  of  analog 
message  x  or  x(t),  decision  7i*  analog  megiitude  7 or  time  function  7(o.  res¬ 
pectively.  Here,  in  a  case  of  precise  reproduction 


Y-*.  or  T(0 -*(<)• 

In  several  cases,  the  requirement  is  not  simple  message  reproduction,  but 
rather  reproduction  with  performance  of  some  additional  operations  such  as  ampli¬ 
fication,  differentiation,  integration,  prediction  of  the  future  law  of  message 
change,  and  so  on.  Thus,  for  example,  the  requirement  in  ra<far  often  is  to  use 
observation  of  target  trajectory  over  given  time  interval  (0,  T)  as  the  basis 
for  a  prediction  of  a  type  of  trajectory  outside  the  bounds  of  this  interval  (for 
proper  antiaircraft  shell  direction). 
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The  following  condition  must  be  met  in  these  cases  when  there  is  no  noise 

y(t)=Dx(t), 

where  0  —  some  operator  corresponding  to  the  requisite  transform  (differentiation, 
prediction,  and  so  on). 

Decision  rule  A  (y|j/)  may  be  regular  (non- randomized)  or  statistical  (randomized! 

A  regular*  rule  is  one  in  which  precisely-defined  (i.  e. ,  with  probability  /309 
equalling  unity)  decision  7  c  or  'Y { t )]  corresponds  to  each  signal-plus-noise  reali¬ 
zation  y(t). 

A  statistical  (irregular)  rule  is  one  in  which  decision  Y Is  linked  with  y 
by  a  statistical,  rather  than  regular,  dependency,  i.  e. ,  for  a  given  y,  the  known 
is  not  decision  Y »  which  the  receiver  will  make,  but  only  the  probability  of 
that  decision. 

All  contemporary  receiving  devices  operate  in  accordance  with  a  regular  law 
(only  equipment  instability  and  malfunctions  cause  deviations  from  regularity). 
However,  if  a  special  set  of  statistical  mechanisms  are  installed  in  a  receiver, 
it  is  possible  to  force  it  to  operate  with  respect  to  a  statistical  law. 

These  same  statistical  decision  rules  are  inherent  in  all  living  organisms, 
including  a  human.  The  reaction  of  a  given  person  to  a  given  specific  action 
of  the  external  environment  may  not  be  predicted  with  complete  validity — only 
the  probabilities  of  certain  reactions  to  a  given  action  may  be  known  beforehand. 
Presence  of  such  irregularity  in  the  conduct  of  living  organisms  turns  out  to 
be  harmful  for  these  organisms  in  some  cases  and  a  useful  factor  in  others.  It 
is  useful,  in  particular,  in  that  it  hampers  its  enemies  struggling  with  this 
organism. 

Correspondingly,  in  machinery  as  well,  particularly  in  receiving  devices, 


"Here  and  in  future,  the  terms  regular  and  irregular  are  synonyms  for  the  terms 
determinate  and  indeterminate,  respectively. 
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irregular  operation  in  many  cases  will  hinder  results,  but  may  tum  out  to  be 
useful  under  certain  complex  conditions.  Thus,  for  example,  irregular  operation 
of  racfar  receivers  designed  for  employment  against  enemy  aircraft  may  hinder  the 
enemy  in  creating  effectively-organized  electronic  countermeasures. 

Thus,  in  a  majority  of  cases,  it  is  advisible  to  design  receivers  with  regular 
decision  rules  but,  in  some  special  cases,  it  may  tum  out  to  be  more  useful  to 
have  receivers  with  statistical  decision  rules. 

If  a  decision  rule  is  statistical,  then  A(yI^)  denotes  the  probability  that 
the  receiver  will  make  decision  7  when  realization  y(t)  is  present  at  its  input. 

If  7is  an  analog  majiitude,  then  A  (vU/)  is  not  a  probability,  but  a  decision  7 
probability  density.  If  If  a  time  function,  then  A(y|y)  is  a  multidimensional 
(n- dimensional)  probability  density. 

This  regular  dependence  exists  between  realization  y(t)  and  decision  7 » 
given  a  regular  decision  rule 

y-r(y);  (17.7) 

therefore,  expression  (17.7)  expresses  a  regular  decision  rule. 

If,  in  the  case  of  a  regular  decision  rule  also,  A  (yijf)  is  understood  for 
the  purposes  of  standardization  to  be  the  probability  density  of  decision  7 /310 
for  a  given  y,  then  it  should  be  assumed  that 

A  (V I  y)  —  ®  If —  r  (y)|,  (17.8) 

where  6  (z)  is  the  delta- function  of  z. 

Thus,  for  all  decision  rules  (regular  and  irregular),  it  is  possible  to 
understand  decision  rule  A<Yiif)  to  be  the  probability  (or  probability  density) 
that  decision  7  will  be  made  for  a  given  y.  Here,  in  the  case  of  regular  decision 
rules,  tyviy)  is  a  (17.8)-type  delta-function. 

Thus,  based  on  analysis  of  realization  y(t),  in  accordance  with  rule  A(vly)  , 
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a  receiver  processes  decision  f .  Evidently,  decision  rule  A(v|#)  is  the  optimum 
rule  (i.  e. ,  that  receiver  structure)  in  which  decision  f  turns  out  to  be  best 
in  some  sense.  Therefore,  the  quantitative  characteristic  of  decision  If  quality 
should  be  selected  in  order  to  find  the  optimum  decision  rule. 

A  loss  function  ("deficit"  function)  is  that  characteristic  /  (■*,  Y).  designating 
losses  ("deficits")  corresponding  to  each  combination  of  message  x  and  decision 
If  taken.  A  quadratic  loss  function  of  the  following  type  is  the  most  widespread 
during  simple  reproduction 


!(*<  ?)  =  (?—*)*• 


(17.9) 


In  the  general  case,  function  /(*,  y)  may  have  the  most-varied  type  depending 
on  receiver  purpose  (detection,  simple  reproduction,  prediction,  and  so  forth) 
and  the  requirements  levied  on  it.  However,  in  all  cases,  it  must  characterize 
losses  linked  with  a  given  message  x  and  decision  ^combination,  i.  e. ,  the  less 
favorable  (from  the  receiver  purpose  standpoint)  given  combination  (x,  f  )  is, 
the  greater  the  magnitude  /(•«.  Y)-  corresponding  to  it  must  be. 

In  a  case  of  simple  reproduction,  an  increase  in  the  reproduction  error  modulus, 
|y  —  x\,  manifests  itself  in  deterioration  of  decision  quality.  Therefore,  during 
simple  reproduction,  a  (17.9)-type  quadratic  function  is  one  of  the  most  suitable. 

In  several  cases,  a  loss  function  may  depend  not  only  on  current  x  and  f 
values,  but  also  on  all  values  of  x  and  If  in  a  certain  region  of  change  of  their 
arguments.  In  this  case,  the  loss  function  is  converted  to  a  functional. 

Finally,  for  some  loss  function  ./(jc,  y)  tyPes>  turns  out  to  be  dependent 
also  on  decision  rule  Afrl#)- 

For  example,  let  the  following  function  be  the  loss  function: 


(17.10) 


where  P{x\y)  —  probability  (or  probability  density)  x  for  a  given  f .  The  lower 
probability  P(x Jy),  ,  the  greater  the  ambiguity  x  for  a  given  f .  Therefore,  in 


information  theory,  expression  (17.10)  is  called  ambiguity  x  for  a  given  'J '.  /311 
Evidently,  the  greater  this  ambiguity,  the  worse  things  are;  therefore,  a  (17.10)- 
type  function  in  principle  is  useful  for  an  estimate  of  receiver  quality  and  may 
be  selected  as  the  loss  function.  Since  probability  x  for  a  given  will  depend 
on  receiver  structure,  i.  e.,  on  decision  rule  A(y|y),  ,  then  a  (17.1Q)-type  loss 
function  also  will  depend  on  decision  rule  A(y|t/).  On  the  other  hand,  a  (17.9)- 
type  function  is  an  example  of  a  loss  function  that  does  not  depend  on  decision 
rule  A(y|#). 


Thus,  loss  function  /(.*,  v)suitable  for  the  operating  conditions  of  a  given 
receiver  is  selected  when  estimating  receiver  quality. 


Since  x  and  ^are  random  magnitudes  (or  random  time  functions),  then  loss 
!(*•  V)  also  is  a  random  magnitude.  Therefore,  decision  quality  in  the  theory  of 
statistical  decisions  is  evaluated  not  by  the  magnitude  of  loss  but  by 

its  expected  value: 


R=l(x,  y), 

where  a  superimposed  line  denotes  statistical  averaging,  i.  e., 

/?=  f  dx  f  I(x,y)P[x,y)dy, 


(17.11) 


(17.12) 


where  and  Aj  —  regions  of  all  possible  x  and  'J values. 

Magnitude  R,  called  the  average  loss  (average  deficit)  is  a  measure  of  decision 
quality  (receiver  quality) — the  smaller  this  magiitude,  the  better  the  decision. 
Therefore,  that  decision  rule  A(y|y),  which  insures  the  minimum  R  magnitude  is  called 
the  best  (optimum)  rule.  Decision  rules  A*  (y|y) providing  the  minimum  R  magnitude 
are  called  Bayes  decision  rules. 

If  loss  function  /  (x,  y)  is  such  that  it  will  not  depend  on  decision  rule 
A'(y|y),  ,  then  magnitude  ^x>  ^  is  called  risk,  while  R  is  the  average  risk. 

If  loss  function  I(x,  y)  will  depend  on  decision  rule  A(y|//),  then  finding  the 
R  minimum,  i.  e.,  seeking  the  optimum  decision  rule,  becomes  much  more  difficult. 
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Therefore,  in  a  majority  of  cases,  the  loss  function  is  selected  so  that  it  will 
not  depend  on  the  decison  rule.  In  future,  we  will  limit  ourselves  for  simplicity 
to  just  such  cases  and  accordingly  we  will  call  magnitude  R  the  average  risk. 

Here,  the  optimum  decision  rule  is  the  one  pnviding  minimum  average  risk  A  (y|y) 
i.  e.,  converts  expression  (17.12)  into  a  minimum.  It  follows  from  this  that 
such  a  decision  rule  falls  in  the  Bayes  decision  rule  class.  Seeking  this  rule, 
we  thereby  will  find  a  mathematical  description  of  the  optimum  receiver  structure. 

Characterization  of  message  fidelity  by  one  number— average  risk  R— naturally 
is  not  universal  or  optimum  in  all  respects  and  generally  it  is  impossible  to 
create  a  criterion  that  is  standard  and  optimum  in  all  respects. 

However,  the  minimum  average  risk  criterion  is  significantly  more  general  /312 
than  the  criteria  presented  in  preceding  chapters  and  they  may  be  obtained  from 
it  as  particular  cases. 

17.2  Basic  Relationships  for  the  Minimum  Average  Risk  Criterion 

Expression  (17.12)  for  average  risk  is  valid  for  various  types  of  messages 
x  and  decisons  Y ,  with  the  following  comments: 

1.  In  a  case  of  discrete  x  and  Y  >  the  integrals  in  this  expression  must 

be  replaced  by  sums,  while  P  (x,  y) should  be  understood  to  be  the  composite  probabilit1 
of  magnitudes  x  and  Y . 

2.  In  a  case  of  analog  x  and  Y  t  distribution  P  (*»  V)is  the  x  and  ^composite 
probability  density . 

If  x  and  T'are  random  time  functions,  then  distribution  P  (x,  y)is  n-dimenaional. 

We  will  examine  several  examples  for  illustration. 

For  example,  let  a  message  have  two  possible  values  x  s  xQ  and  x  s  x^  (binary 
transmission)  and  there  be  only  two  possible  decisions  Y  =  Y 0  (message  xQ  is 
at  input)  and  Y  s  7^  (message  x^  is  at  input). 


In  a  case  of  telegraph  radio  communications,  xq  and  should  be  understood 
to  mean  spacing  and  pulsing,  while  decisions  Ya  and  mean  supplying  spacing 
and  pulsing,  respectively,  at  output. 

In  a  case  of  detecting  signal  presence  or  absence,  values  xq  and  x^  correspond 
to  signal  absence  and  presence,  while  and  to  the  decisions  "no"  (no  signal 
at  input)  and  "yes"  (signal  at  input). 

In  this  case,  expression  (17.12)  takes  the  form  of  a  sum: 


R -22f(**Y)  P  («*•?)• 

*  i 


(17.13) 


In  the  case  under  examination,  the  following  are  possible  message  x  and  decision 
T combinations: 


(*#.?•).  <*».Yi).(*i.T«)  an(j  (x„Yi)- 


(17.14) 


Evidently,  the  first  and  fourth  combinations  correspond  to  correct  decisions, 
while  the  second  and  third  correspond  to  incorrect  decisions. 


We  will  designate  losses  corresponding  to  these  combinations: 

I  (*..  Y*) *»c»;  /  (x„  y,)  =  cf ;  / (x„  y.)  =  c,;  /  (x„ y,)  -  ct.  (17.15) 


Then,  in  accordance  with  (17.13),  we  have 


*  =  c,  P (x„  Y.)  +c,P (*„  Y»)  +  P (*v  Yt)  +  ctp (Xv  Yi)- 


(17.16) 


Usually,  zero  losses  are  ascribed  to  correct  decisions,  i.  e. ,  we  assume: 


Ci  =  0;  c4  =  0; 


(17.17) 


R-c%P  (*„  Yi)  +c,P  (*,.  y,). 


(17.18) 
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This  expression  may  be  written  also  in  the  following  form: 


R  -  c,  P  (*,)  P„  (Yi)  + c,  P  (xt)  P„  (v,).  (17.19) 

For  instance,  let's  examine  a  case  of  binary  signal  detection.  Then,  evidently, 

P*.(Yi)=P,»  and  PXl  (Yo)  -  P„,,  (17.20) 

where  P*  T  and  PBp  —  false-alarm  and  signal  miss  probabilities. 

Here,  average  risk  R  equals 

R^^tP  (*»)  PB  * "t*  f*  P  (•*,)  P np.  (17.21) 

This  expression  completely  coincides  with  expression  (14.12)  since  and 
play  the  role  of  weight  factors  a  and  b. 

If  you  assume  that 


C|  ==  C,  SB  1 . 


(17.22) 


then,  the  result  is 

R**  R Rai~i" R (* t) Pjip  ^  Pom*  (17.23) 

i.  e.,  for  losses  c^,  C2,  Cj,  and  c^  selected  in  accordance  with  relationships 
(17.17)  and  (17*22),  average  risk  R  coincides  with  composite  error  probability 
R«m  and,  consequently,  the  minimum  average  risk  criterion  coincides  with  the  minimum 
composite  error  probability  criterion. 

Consequently,  criteria  examined  in  Chapter  14  actually  are  particular  cases 
of  the  minimum  average  risk  criterion. 

We  now  will  examine  an  example  of  simple  reproduction  of  an  analog  message. 
Here,  as  indicated  in  §  17.1,  a  (17.9)-type  quadratic  function  is  one  of  the  most- 
widespread  loss  function  I  (*.  Y)  types. 
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Here,  from  (17.9)  and  (17.12)  we  have 


/?=  /  dx  \  (v — x)'  P  (x,  v)  dy=  (y  — *)*, 


(17.24) 


i.  e. ,  for  a  quadratic  loss  function,  average  risk  R  coincides  with  mean  square 
(7  —  x7  of  the  message  reproduction  error  and,  consequently,  the  minimum  average 
risk  criterion  coincides  with  the  minimum  root -mean- square  error  criterion. 

As  will  be  demonstrated  below,  in  many  essentially-interesting  cases,  the 
minimum  average  risk  criterion  coincides  also  with  the  maximum  inverse  probability 
criterion  (or  maximum  inverse  probability  density  criterion). 

Consequently,  selecting  some  type  of  loss  function  I(x ,  v).  we  may  obtain 
different  optimizations  from  (17.12),  including  all  the  simpler  criteria  examined 
in  preceding  chapters  as  well. 


For  practical  use,  expression  (17.12)  should  be  transformed  so  that  given  /314 
a  priori  probabilities  P(x)  and  Px(y)  and  decision  rule  A  (y|y)  are  included  in 
it  in  an  explicit  form.  We  will  perform  this  transform,  assuming  that  the  decision 
rule  is  regular,  i.  e.,  is  described  by  regular  dependence  (17.7). 


Considering  that 


from  (17.12)  we  obtain 


P(*,Y)  =  P(x)P.(Y), 


R  -  £  dx  P (x)  j  /  (x, 7)  P,(7)<*7-  (17.25) 

A*  Ay 


Having  designated 


j  *(x,  7)  P*  (7)  dy, 
•  Ay 


(17.26) 


we  have 


It  follow 3  from  expressions  (17.26)  and  (17.27)  that  is  a  conditional  risk 
determined  for  a  given  message  x  value,  while  the  average  risk  may  be  obtained 
by  averaging  this  conditional  risk  with  respect  to  all  possible  message  x  values. 

Expression  (17.12)  may  be  transformed  also  in  the  following  manner.  Since 
the  following  regular  dependence  exists  between  y  and 

T-r(y), 


then 


P  (x,  y)dxdr-P(x,  V)  dxdy, 


where 


T-r(y). 


and,'  from  (17.12)  we  have 

f  J  T (.*,//)/ |jr,r(y))dxdjj. 

A-  A„ 


(17.28) 


[Since  integration  bounds  with  respect  to  y  will  not  depend  on  x  and  vice  versa, 
then  any  integration  procedure  may  be  used  with  respect  to  x  and  y  in  expression 
(17.28)]. 


Expression  (17.28)  is  convenient  because  given  a  priori  distribution  P(x,  y) 
and  decision  rule  P(y)  are  included  in  it  in  explicit  form. 

The  optimum  decision  rule  P(y)  is  the  one  in  which  expression  (17.28)  is 
minimal,  i.  e.,  in  which 


R 


P  (x,  y)  I  \x,  T  (y)|  dxdy-  min. 


(17.29) 


Since 


P(x,  y)  =  P(y)l\(x), 


then  it  also  is  possible  to  repreeent  expression  (17.28)  in  the  following  form:  /315 


(17.30) 


where 


R,  =  j  P9(x)l{x,r(y)\dx. 


(17.31) 


It  follows  from  relationships  (17.30)  and  (17.31)  that  Ry  is  a  conditional 
risk  corresponding  to  given  realization  y  and  average  risk  R  may  be  found  by  averaging 
this  conditional  risk  with  respect  to  all  possible  realizations  y. 

Since  P(y)  and  Ry  are  non-negative  functions,  then  it  follows  from  (17.30) 
that  conditional  risk  Ry  must  be  minimal  for  all  possible  realizations  y  in  order, 
to  obtain  minimum  average  risk  R. 

Consequently,  that  decision  rule  Tap  (y)  which  for  any  y  minimizes  conditional 
risk  Ry  magnitude  is  optimum,  i.  e.,  the  one  which  insures  that  this  condition 
is  met 

^PrW/lx.r(y)Jdr«min  for  any  y>  (17.32) 

Correspondingly  optimum  is  that  receiving  device  which,  for  each  given  realizaton 
y(t),  produces  an  output  "decision”  with  respect  to  the  rule 

T  **  rap  (y), 

where  r n9(y)  —  decision  rule  P (y)  corresponding  to  the  minimum 
ditional  risk  Ry. 

We  will  compare  this  decision  rule  with  the  simpler  rule  of  maximum  inverse 
probability  density  Py(x)  examined  in  preceding  chapters.  Here,  for  brevity, 


(17.33) 

(for  any  y)  con- 


! 

| 

we  will  call  the  rule  based  on  maximum  inverse  probability  density  P^(x)  a  simple 
rule,  and  the  rule  minimizing  conditional  risk  Rv  for  all  y  and,  consequently, 
average  risk  R  as  well,  a  generalized  rule. 

i 

When  the  simple  rule  is  used,  the  decision  T'selected  is  that  message 
value  which,  for  any  y,  is  converted  into  the  inverse  probability  density  Py(x) 

|  maximum,  i.  e.,  the  simple  rule  has  the  form: 

?  =  (17.34) 

|  where  xn  —  message  x  value  at  which  this  condition  is  met 

P*(x)  *=  max  for  any  y.  (17.35) 

The  generalized  rule  has  the  (17.33)  form  and  is  converted  to  the  minimum 
(for  any  y)  conditional  risk  Ry,  i.  e.,  will  be  found  from  condition  (17.32). 

It  follows  from  a  comparison  of  expressions  (17.32)  and  (17.35)  that  a  gener¬ 
alized  decision  rule  in  the  general  case  is  more  complex  for  computations  and 
for  practical  realization  since  expression  (17.32)  includes  loss  function  /316 
/lx,  T  ((/)].  as  well  as  inverse  probability  density  Py(x).  However,  introduction 
of  the  loss  function  makes  it  possible  to  consider  the  relative  danger  of  different 
errors  as  decision  If  is  developed. 

Thus,  for  example,  given  simple  reproduction  and  a  loss  function  of  the  type 

l  (x,  v)~(y—x)'. 


expression  (17.32)  takes  the  form 

(*)(*— r(y)l*<bc=min  for  any  y.  (17.36 

Here,  the  greater  the  error  jx  —  yj,  modulus,  the  more  weight  it  has  in  expression 
(17.36).  Therefore,  generalized  decision  rule  r«p(y),  found  from  condition  (17.36) 
considers  that  large  errors  are  more  dangerous  since  this  circumstance  is  not 


*  ajihL  u 
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considered  at  all  in  a  simple  decision  rule.  However,  for  several  «  >48,  results 

provided  by  simple  and  generalized  decision  rules  coincide. 

For  instance,  let  the  loss  function  have  the  following  special  form: 

/(*,y)=c— «(*— v),  (17.36a) 

where  c  —  some  constant,  while  6  (*  —  y)  delta- function.  Substituting  function 

(17.36a),  called  a  simple  loss  function,  into  expression  (17.31),  we  obtain 

Rt  **  C —  P,  (X)  |  rmV(f)' 

Consequently,  the  minimum  (with  respect  to  f )  magiitude  of  conditional  risk  Ry 
and,  consequently,  of  average  risk  R  as  well,  will  occur  if  the  decision  y  =  r (y) 
selected  coincides  with  that  x  value  corresponding  to  the  inverse  probability 
density  Py(x)  maximum. 

Thus,  for  a  simple  loss  function,  a  simple  decision  rule  (based  on  the  maximum 
inverse  probability  density  criterion)  provides  the  same  result  as  a  generalized 
decision  rule  (based  on  the  minimum  average  risk  criterion). 

As  will  be  shown  in  the  next  section,  when  certain  additional  conditions 
often  occurring  in  practical  cases  are  met,  results  obtained  from  simple  and 
generalized  decision  rules  coincide,  not  only  for  a  (17.36a)-type  simple  loss 
function,  but  in  a  broad  class  of  loss  functions  finding  wider  use  as  well. 

17.3  Minimax  Optimization 

The  minimum  average  risk  criterion  is  based  upon  use  of  complete  information 
on  the  laws  of  signal  and  noise  distribution. 

On  the  one  hand,  this  is  an  advantage  of  the  criterion  since  it  makes  it 
possible  to  use  the  maximum- possible  information  about  the  signal  and  noise  /317 
when  reproducing  the  message.  However,  this  very  special  feature  of  the  criterion 
causes  difficulties  in  those  cases  when,  according  to  problem  conditions,  signal 
and  noise  distributions  are  not  precisely  known.  A  priori  distributions  of  messages 


P(x)  quite  often  tum  out  to  be  unknown.  In  some  cases,  even  which  magnitude 
(which  time  function)  is  the  needed  parameter— random  or  regular,  may  be  unknown. 
Different  approaches  to  surmounting  this  so-called  "a  priori  difficulty"  are  examined 
in  Chapter  21.  We  will  dwell  now  only  on  the  following  two  methods. 

1.  Uniform  distribution  P(x)  method. 

If  distribution  P(x)  is  completely  unknown  or  it  even  is  unknown  whether 
message  x  is  random,  then  it  is  most  natural  to  assume  in  this  case  that  distribution 
P(x)  is  uniform,  i.  e. ,  to  consider 


P  (*)  -  k. 


(17.37) 


where  k  —  some  constant. 

Stemming  from  distribution  P(x)  selected  in  this  manner,  it  is  possible  further 
to  determine  average  risk  R  in  the  normal  manner,  from  formula  (17.23)  for  example. 
Since  it  is  necessary  when  seeking  the  optimum  decision  rule  to  find  only  minimum 
risk  R,  the  magnitude  of  constant  k  plays  no  role  here  and  it  may  be  random. 

The  drawback  to  this  method  is  that  if,  in  actuality,  distribution  P(x)  will 
tum  out  to  be  very  irregular,  decision  rule  P  (y)  found  may  tum  out  to  be  far 
from  optimum. 

2.  Minimax  optimization  criterion. 

This  criterion  will  reduce  the  maximum- possible  value  of  conditional  risk 
Rx,  rather  than  average  risk  R,  to  the  minimum.* 

Mathematically,  the  minimax  criterion  is  formulated  in  the  following  manner: 

max  R„ (A  J  < max*, (A)  for  all  A,  (17.38) 

where  A  =  P(y)  —  random  decision  rule,  while  A^  =  P^(y)  —  decision  rule  insuring 


•Hence  the  term  "minimax"— "MINImization  of  the  MAXimum." 


minimization  of  the  maximum  conditional  risk  R  value  and  therefore  called  a  minimax 
decision  rule* 

The  symbol  Rx(A)  underscores  that  conditional  risk  will  depend  on  decision 
rule  A. 

It  follows  from  determination  of  the  minimax  criterion  that  it  provides  the 
best  decision  rule  (i.  e.,  best  receiving  device  operating  principle)  for  the 
worst  case.  This  is  the  essence  of  its  advantage,  but  it  is  linked  to  its  short¬ 
coming:  since,  in  actuality,  the  worst  case  may  not  occur  at  all  or  be  of  low 
probability,  then  a  minimax  decision  rule  may  tum  out  (in  all  cases  or  in  a  majority 
of  cases)  to  be  far  from  optimum. 

Wald,  stemming  from  several  sufficiently-general  prerequisites,  obtained  /318 
the  following  important  results  for  the  minimax  criterion  [30,  16,  and  118]: 

1.  A  minimax  decision  rule  exists. 

2.  Any  minimax  decision  rule  A^  is  a  Bayes  decision  rule  determined  for 

some  a  priori  distribution  R^x),  called  the  least-favorable  a  priori  distribution. 

3.  A  least-favorable  a  priori  distribution  exists. 

4.  The  Bayes  decision  rule  A,p  [i.  e. ,  TM  (y)l.  in  which  conditional  risk 
Rx  will  not  depend  on  x  is  a  minimax  decision  rule  (A^). 

5.  The  minimum  (with  respect  to  all  rules  A)  value  of  the  maximum  [with 
respect  to  all  distribution  P(x)  types]  average  risk  corresponds  to  a  minimax 
decision  rule. 

Ue  will  recall  that  the  term  Bayes  is  understood  to  mean  that  decision  rule 
which  minimizes  average  loss,  in  particular,  average  risk  R. 

It  follows  from  the  aforementioned  results,  first,  that  satisfaction  of 
relatively-general  conditions  usually  occurring  in  practice  suffices  for  existence 
of  a  minimax  decision  xule. 


«s 


Second,  minimax  decision  rule  may  be  considered  a  Bayes  decision  rule 
insuring  receipt  of  minimum  average  risk  for  some  least-favorable  a  priori  message 
PM(x)  distribution. 

In  short,  a  minimax  criterion  insures  receipt  of  minimum  average  ri8k  for 
the  least-favorable  a  priori  message  distribution.  Therefore,  if  it  turns  out 
to  be  possible  to  compute  the  least-favorable  a  priori  distribution  P^(x),  then 
the  optimum  decision  rule  will  be  found  next  using  the  conventional  method  examined 
in  the  next  section. 

In  the  general  case,  computation  of  distribution  P^x)  presents  great  diffi¬ 
culties.  However,  in  some  cases,  it  will  be  foirtd  relatively  simply  through  use 
of  result  4  listed  above,  which  means  that  Bayes  decision  rule  I'rpU'),  in  which 
conditional  risk  R^  will  not  depend  on  x,  is  minimax  rule  P^Cy). 

The  methodology  for  finding  distribution  P^tx)  here  may  comprise  the  following: 

1.  We  write  the  expression  for  average  risk  R,  in  the  (17.25)  form,  for 
example: 


/?»  J  dxP(x)  J  l{x,y)P,(y)dy. 


(17.25) 


2.  We  will  find  Bayes  decision  rule  r„  (y),  ,  i.  e. ,  the  decision  rule  in 
which  risk  R  is  minimal.  It  follows  from  (17.25)  that  this  decision  rule  will 
depend  on  the  type  of  a  priori  P(x)  distribution  selected  during  computation  of 
the  expression,  i.  e. , 


r«P(y)=rRP  \y,p(x)\. 


(17.39) 


[During  computation  of  expression  (17.25),  distribution  P(x)  is  written  in  /319 
general  form  since  it  still  is  unknown  to  us]. 

3.  We  substitute  decision  rule  rap  (y)  found  into  an  expression  for  conditional 
risk  Rx,  for  instance  into  expression  (17.26),  i.  e. ,  we  assume 


459 


at- 
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R,“J  Hx,y)Px(y)dy,  | 

where  (17.40) 

T  -  r„p  (y)  =  rMP  [y.  P  (x)|. ) 

Here,  conditional  risk  R  turns  out  to  be  dependent  on  a  priori  distribution 

P(x). 

4.  We  try  to  select  the  distribution  P(x)  type  in  such  a  way  that  conditional 
Rx  determined  from  expression  (17.40)  will  not  depend  on  x.  If  this  is  done 
successfully,  then,  as  follows  from  what  was  presented  above,  the  a  priori  distri¬ 
bution  found  in  this  manner  is  needed  distribution  P^(x)  also. 

5.  Having  substituted  the  distribution  P^Cx)  found  into  expression  (17.39), 
we  will  find  the  minimax  decision  rule: 

rM(P)-r„Iy,P«(x)l.  (17.41) 

Thus,  in  those  cases  when  distribution  P(x)  is  selected  successfully  so  that 
conditional  risk  Rx  determined  from  expression  (17.40)  will  not  depend  on  x,  in 
so  doing  least-favorable  a  priori  distribution  P^Cx)  and  minimax  decision  rule 
r„(y)  also  will  be  found. 

If  it  turns  out  that  conditional  risk  Rx  determined  from  expression  (17.40) 
will  depend  on  x  regardless  of  the  form  of  distribution  P(x),  then  this  still 
does  not  signify  that  least-favorable  distribution  P^x)  does  not  exist.  It  only 
denotes  that  it  is  impossible,  using  the  given  method,  to  find  distribution  PM(x) 
in  the  case  under  examination  and  it  must  be  sought  using  other  methods. 

Using  the  aforementioned  methodology,  Middleton  demonstrated,  in  particular, 
that  least-favorable  distribution  P^x)  is  a  uniform  distribution,  given  independent 
additive  signal  and  noise  and  simple  reproduction  of  the  shape  of  a  signal*  with 
a  quadratic  loss  fur-  [of  the  (17.9)  type][118]. 


♦That  is,  when  x  s  x(t)  s  u  (t)  (See  the  next  section). 

460 


Consequently,  in  this  case  important  in  practice,  the  following  uniform  a 
priori  distribution  is  the  least  favorable 


P(x)  =  const, 


and  minimax  decision  rule  I  ^(y)  coincides  with  Bayes  decision  rule  rMp  (y)  found 
for  the  uniform)  a  priori  message  distribution. 


17.4  Some  General  Results  in  Use  of  the  Theory  of  Statistical  Decisions  for 
Analog  Message  Reception  /320 


When  individual  analog  message  values  are  received,  the  most-widespread  loss 
function  is  a  quadratic  function  of  the  type 


/(*.?)=(*-?)•.  (17.42) 

and  optimum  decision  rule  r«p  ( y )  may  be  found  from  condition  (17.36),  which,  following 
evident  transforms,  will  lead  to  the  following  form: 


i 


f  x'Pt(x)dx-2r(y)  f  xP,(x)dx  + 

'x  A* 

+  r*(i/)  j  P,(x)dx  =  min. 


(17.43) 


Consequently,  the  optimum  value  of  rule  '  (y)  is  the  one  in  which  expression  (17.43) 
is  minimal,  i.  e.,  derivative  dRt/d r  equals  zero: 


-2  J  xP9(x)dx+2T(y)  j  P„(x)dx~0. 


Considering  normality  conditions  (4.7)  and  (17.44),  the  result  is 


(17.44) 


r*p(y)“  J  xP,(x)dx. 


(17.45) 


It  follows  from  (17.45) 
the  conditional  (i.  e., 


that,  in  this  case,  optimum  decision  rule  r»P  (y)  equals 
for  a  given  y)  expected  message  x  value. 


Since  distribution  Py(x) 


in  the  case  of  reception  of  individual  analog  message 
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values  is  a  unidimensional  probability  density,  then  formula  (17.45)  has  a  simple 
geometric  interpretation.  rBp  (y)  equals  the  X-axis  of  the  "center  of  gravity" 
of  the  area  located  under  curve  Py(x).  Consequently,  in  this  case,  the  optimum 
receiver,  during  reception  of  given  realization  y(t),  must  select  as  decision 
the  X-axis  of  the  "center  of  gravity"  of  the  area  located  under  curve  Py(x). 

Here,  it  will  insure  the  minimum  average  risk,  i.  e.,  in  this  case  (for  a  quadratic 
loss  function)  the  minimum  root-mean- square  error. 

Known  (given)  a  priori  distributions  P(x)  and  (y )  should  be  used  for  practical 
realization  of  algorithm  (17.45)  to  express  the  a  posteriori  (inverse)  probability 
density  Py(x)  included  in  the  logarithm.  To  do  so,  we  will  use  relationship  (4.5). 
Substituting  it  into  (17.45),  we  obtain 


f  xP  <x)  P,  (y)  dx 


U  (id-  — 


Pin) 


But 
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P  (y)  *  £  P  (x>  y)dx-  j  P  (x)  p,  (y)  dx. 


therefore,  finally  we  have 


T-r,,p(y) 


]  *P(*)P,iv)d* 

_ 

f  p  (*)  p«  (S')  4* 

*9 


(17.46) 


Since  a  priori  distributions  P(x)  and  PxCy )  are  assumed  known,  expression  (17.46) 
makes  it  possible  in  principle  to  compute  optimum  decision  rule  Tup  (if),  i.  e., 
to  find  the  optimum  receiver  structure. 


h. 


rr> 


Substituting  decision  rule  T  (y)  *•  r„p  (y)  found  into  expression  (17.28)  and 
considering  that  P(x,  y)  s  P(x)Px(y),  it  then  is  possible  also  to  compute  the 
minimum  value  of  the  average  risk  R  =  Rm m  corresponding  to  this  rule. 

We  now  will  examine  reproduction  of  messages  that  are  random  time  functions 
(rather  then  random  magnitudes).  Reproduction  of  messages  that  are  random  time 
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functions  at  the  present  time  is  called  filtration.  Therefore,  in  future,  we 
also  will  use  this  term. 


In  filtration,  it  is  necessary  to  differentiate  between  real-time  and  delay 
message  reproduction.  Figure  17.2  illustrates  the  difference  between  the  two. 

Initially,  we  will  examine  real-time  reproduction.  Here,  for  simplicity, 
we  will  assume  that  the  time  change  is  discrete,  with  digitization  interval  At 
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(Figure  17.2).  Then,  the  complete  duration  of  the  reproduced  message  equals 
T  =  nAt,  while  real  time  t.  s  iAt.  During  real-time  reproduction,  message  x(t) 
must  be  reproduced  without  a  delay  relative  to  real-time  information  y(t)  arriving 
at  receiver  input.  This  signifies  that,  by  moment  t^  =  IAt,  it  is  necessary  to 
supply  decision  If  ^  concerning  the  value  of  message  x^  =  x(t^),  while  by  random 
moment  t^  =  IAt  (i  *  1,  2,  .  .  .,  n),  to  supply  decision  If  ^  concerning  the  value 
of  message  x.  s  x(iAt)  (Figure  17.2c). 


In  delay  reproduction,  a  lag  in  shaping  output  realization  ^  (t)  for  a  time 
eqifil  to  (or  exceeding)  T  is  permissible  (Figure  17. 2d). 

In  delay  reproduction,  high  realization  x(t)  fidelity  may  be  obtained.  Actually, 
in  this  case,  by  the  moment  the  decision  is  made  about  any  message  x(t)  values 
(x  ,  .  .  .,  xif  .  .  .,  xn),  the  receiver  possesses  entire  realization  y(t)  during 
interval  (0-T).  In  a  case  of  real-time  reproduction,  decision  7j_  concerning 
message  x.  value  must  be  taken  when  only  that  part  of  realization  y ( t )  which 
occurred  during  interval  0  —  iAt  is  present.  Therefore,  all  other  conditions 
being  equal,  initial  realization  x(t)  sections  (corresponding  to  slight  values 
of  number  i)  will  be  reproduced  less  accurately  and  only  finite  value  xn  =  x(nAt) 
will  be  reproduced  with  the  same  accuracy.  /322 

However,  in  a  number  of  practical  problems,  a  significant  lag  is  not  permitted; 
in  addition,  in  delay  reproduction,  the  algorithm  of  the  optimum  process  may  be 
complicated. 


In  future,  we  will  limit  ourselves  to  examination  only  of  real-time  reproduction. 
In  this  case,  as  noted  above,  at  each  moment  in  time  t  =  t^,  the  requirement  is 
best  reproduction  of  the  message  xi  =  x(tA)  value  corresponding  to  a  given  moment 
in  time  and,  consequently,  the  loss  function  must  be  written  in  the  following 
form: 


/(*,  ?)  =  /(*„  y,), 

where 

x,-x(i,h  Yi  =  Y(?i)- 


(17.47) 


Here,  in  accordance  with  (17.32),  optimum  decision  rule  rKp  (y)  is  the  one  which 
insures  that  this  condition  is  met 


J  P,  (*i)  t  l*i»  T  (y)J  mfn, 

As 

where 


(17.48) 
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But 


P, (x,)  -  J  ...  J  P„ (*, . x,)dxt... dx,- 1 , 

Axt  A*t- 1 


and  condition  (17.48)  takes  the  following  forms 


j  ...  J  Pt(xit  r(t/)]dxt...dx,  =*min. 


(17.49) 


*1  *i 


In  future,  we  will  limit  ourselves  to  examination  of  a  quadratic  loss  function. 
Then,  in  accordance  with  (17.47),  one  should  assume 

/(*.Y)*»(*i-Ti)' 


and,  from  (17.49),  we  obtain 


/?,=■!  J  (*»••••»  *1)1*1“  r(y)l*dx,  min, 


(17.50) 


l.  e. 


Doing  calculations  analogous  to  those  presented  above  [during  derivation 
of  formula  (17.46)],  we  obtain 


where  y  = 


^  ...J  *i  P  C*i, ... ,  *i)  Px, . x,  (y)  <to, ...  <tx, 

Ti  ”  r*p  (</)  =•  ‘j  *  j  P{Xl . *,)P„ . 

(yp  •  •  •»  y^i  ^  s  2,  .  .  .,  n. 


(17.51) 


Since  n  >1,  then  essentially  it  is  impossible  to  realize  such  an  algorithm 
for  optimum  receiver  action  due  to  its  extreme  complexity.  Algorithms  applicable 
for  practical  use  have  been  obtained  only  in  those  cases  when  it  turned  out  to 
be  possible  to  represent  multidimensional  distributions  P(x^,  .  .  •»  xn)  and 
P  .  w  (y,r  •  •  •»  yj  in  a  simple-enough  form. 

Xlf  e  e  if  1  if 

The  following  are  the  aiost  important  of  these  cases. 


465 


First  case.  Message  x(t)  and  noise  um(t)  are  additive,  independent,  and 
have  a  normal  distribution,  i.  e. , 

+««(/), 

where  x(t)  and  um(0  —  independent  normal  random  processes 

Evidently,  in  this  case,  signal-plus-noise  y ( t)  also  has  normal  distribution. 

For  normal  random  process  u(t),  n-dimensional  distribution  P(u^,  .  .  .,  un>  /324 
may  be  represented  in  the  form 

J"  *  ^  y  1 

P(»i . u„)  =  cexp  1—2  2  (“i  ~  «i)  (“J — “i)  J  -  (17.53) 

where  c  --  constant  determined  from  the  normality  condition  and,  in  the  case  under 
examination,  not  having  a  value  [since  it  cancels  itself  after  the  corresponding 
distributions  are  substituted  into  formula  (17.51)]}  ®  —  determinant  of  the 
type 

(17.54) 

whose  elements  are  values  of  correlation  function 

(u,— u,)(u,— uj); 

here 

u,«a(/A/);  u/««</A/);  ^,,  =  (-1  )‘+/A„, 

Aij  —  minor  of  determinant  ®  relative  to  element  . 

Since  processes  x(t)  and  y(t)  in  the  case  under  examination  have  normal  distri¬ 
butions,  (17.53)-type  expressions  are  valid  for  them. 

Substitution  of  expressions  of  this  type  into  relationship  (17.51)  and  exam¬ 
ination  of  formulas  (17.49)  and  (17.51)  demonstrate  that  formation  of  output  effect 
T(t)  for  given  input  realization  y(t)  requires  that  only  linear  operations  be 


\ 

1  (17.52) 


performed  on  this  realization.  This  will  lead  to  a  conclusion  very  important 
for  practice:  if  the  message  and  noise  are  additive,  independent,  and  have  normal 
distribution,  then,  given  a  quadratic  loss  function,  the  optimum  message  reproduction 
system  is  a  linear  system.  In  other  words,  for  the  aforementioned  conditions, 
introduction  of  any  distortions  into  the  system  may  not  decrease  the  magnitude 
of  the  average  risk  (in  this  case,  the  magnitude  of  the  root-mean-square  message 
reproduction  error). 

As  analysis  demonstrates,  this  result  obtained  above  for  a  quadratic  loss 
function  retains  its  force  also  for  several  other  loss  function  types  and  may 
be  formulated  in  the  following,  more-general,  form:  if  loss  function  /  (x,  y)  will 
depend  only  on  the  real-time  value  of  error  (x — y),  while  signal-plus-noise  has 
the  (17.52)  form,  then  the  optimum  system  (providing  minimum  average  risk)  is 
a  linear  system,  while  the  structure  of  this  system  is  just  like  that  of  a  quadratic 
loss  function.  Only  systematic  bias  (expected  value)  of  the  error  in  this  case 
will  depend  on  the  specific  loss  function  type;  if  function  / (x>  y) is  a  nondecreasing 
function  of  the  error  \*  —  Vt  modulus,  then  systematic  bias  will  not  depend  on  specific 
loss  function  type  either. 

Thus,  the  first  case  is  characterized  by  the  fact  that,  for  a  relatively-broad 
class  of  loss  functions  (optimizations),  optimum  filtration  boils  down  to  /325 
linear  filtration,  examined  in  §  2.2. 

Second  case.  Signal-plus-noise  y(t)  has  the  form 

y(0-Me(<,x)  +«„,«). 

where  uc(t,  x)  —  precisely- known  signal  [with  the  exception  of  message  x  =  x(t), 
which  is  its  parameter];  «™(0-  stationary  normal  white  noise. 

Message  x(t)  —  stationary  random  process  with  power  spectrum  close  in  shape 
to  a  rectangle;  message  fidelity  is  high  (high  signal-to-noise  ratio). 

In  this  formulation,  the  filtration  problem  was  solved  for  the  first  time 
by  Kotel’nikov  in  1947  and  the  basic  results  of  the  solution  were  presented  in 
Chapter  7.  The  second  case  is  more  complex  than  the  first  because  that  reproduced 
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message  x(t)  included  in  signal-plus-noise  y(t)  is  not  additive,  but  is  in  the 
form  of  a  signal  u  (t,  x)  parameter.  Kotel'nikov  for  problem  solution  was  not 

C 

required  to  provide  any  specific  type  of  message  x(t)  distribution  P(x^,  ...» 

xn)  since,  in  accordance  with  his  assumptions,  the  type  of  this  distribution  did 

not  exert  a  major  impact  on  optimum  receiver  structure  and  properties  (the  validity 

of  this  assertion  flows  from  Chapter  7  results  and  examinations  presented  in  Chapter 

19).  Distribution  P  .  (y.,  .  .  .,  y  ),  for  the  assumptions  made  about 

XX}  •  •  • f  xn  i  n 

signal- plus-noise  y(t)  type,  turns  out  to  be  normal. 

Third  case.  Signal-plus-noise  y(t)  has  the  form 

y( 0  =  ue«;*:*i» .  ..  aJ+Mm(0.  (17.55) 

where  x  =  x(t)}  o,  =  o,  (/)  (i  =  1, ....  m)  —  parasitic  signal  parameters. 

Here,  signal-plus-noise  y(t)  type  is  more  complicated  than  in  the  preceding 
cases  and  the  optimum  filtration  system  in  the  general  case  turns  out  to  be  nonlinear. 
Therefore,  the  synthesis  problem  solved  in  this  case  is  called  an  optimum  nonlinear 
filtration  problem. 

The  starting  point  for  solution  usually  is  the  minimum  root-mean- square  error 
criterion,  i.  e.,  relationship  (17.51).  Here,  one  of  the  following  types  of  distri¬ 
bution  law  approximation  is  used: 

1.  Markov  approximation. 

2.  Gauss  approximation. 

R.  L.  Stratonovich  in  1959 — 1960  for  the  first  time  developed  the  Markov 
approximation  method  [138,  154,  155,  156,  and  others]  and  it  was  based  on  the 
assumption  that  message  x(t)  and  signal-plus-noise  y(t)  are  components  of  a 
multidimensional  Markov  process. 

I.  A.  Bol'shakov  and  V.  G.  Repin  in  1961  [156]  proposed  the  Gauss  approximation 
method  and  it  was  developed  in  subsequent  works  [127,  133,  end  others]. 


The  following  chapter  is  devoted  wholly  to  the  theory  of  optimal  nonlinear 
filtration  in  view  of  its  great  importance.  Here,  the  presentation  for  brevity 
only  will  cover  the  Gauss  approximation.  This  is  because,  for  a  Gauss  approximation, 
results  are  simpler  and  clearer.  However,  a  Gauss  approximation  requires  /326 
several  serious  assumptions,  which  in  principle  may  be  avoided  in  the  Markov  approx¬ 
imation.  Therefore,  the  Markov  theory  also  has  great  theoretical  and  practical 
significance.  Its  foundations  are  presented  briefly  in  [133],  while  its  applications 
are  examined  in  several  works  [130,  158,  and  others]. 

To  conclude  this  chapter,  we  will  note  the  following,  mo re- particular  but 
important,  results  obtained  in  the  theory  of  optimum  filtration.  They  will  apply 
to  a  case  when  message  x(t)  and  noise  um  (Q  are  additive  and  independent  and  they 
comprise  the  following: 

1.  If  a  priori  message  distribution  P(x)  is  uniform,  then  Bayes  decision 
rule  r„p  (y)  has  the  so-called  property  of  translation,  i.  e. , 


(y + *-)  =*  r*P  (y)  + 


(17.56) 


where  A  —  random  fixed  magnitude  (or  time  function).  This  denotes,  for  example, 
that,  if  fixed  message  x(t)  is  supplied  to  optimum  filtration  system  input  and 
this  message  changes  by  magnitude  A(t),  then  the  oscillation  at  system  output 
changes  by  the  same  magnitude. 


2.  If  no  restrictions  are  placed  on  distribution  P(x),  then  its  least-favorable 
form,  PM(x),  is  a  uniform  distribution  (this  special  feature  already  was  noted 
in  the  preceding  section). 
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OPTIMUM  NONLINEAR  FILTRATION 

18.1  Basic  Relationships  During  Optimum  Nonlinear  Filtration 

As  noted  in  §  17.4,  decision  rule  (system  operation  algorithm)  (17.51),  which 
insures  minimum  root-mean-square  error  in  simple  message  x(t)  reproduction,  corres¬ 
ponds  to  optimum  linear  filtration.*  In  accordance  with  the  Gauss  approximation 
method,  the  following  basic  assumptions  are  made  to  bring  this  decision  rule  to 
a  form  suitable  for  practical  use: 

1.  Message  x(t)  has  a  normal  law  of  distribution  [with  known  mean  value 
x(t)  and  correlation  function  Rx(t^,  ^2^  3* 

2.  Message  x(t)  changes  slowly  compared  with  additive  noise  “m  (0  and  /327 
with  parasitic  signal  parameters  which  change  over  time,  i.  e.,  these  conditions 
are  met 


‘nop 


at »  Ti 


nop  m* 


•hop 


«»T, 


KOP  Si 


(18.1) 


♦Some  generalization  to  a  case  of  complex  message  reproduction  is  presented 
in  §  18.4. 


where  tkop*>  tmopui  and  Tkop«  --  message  correlation  time,  additive  noise  time,  and 
parasitic  signal  parameter  time,  respectively  (signal  fluctuations). 

3.  Message  x(t)  reproduction  is  very  accurate,  i.  e. ,  mean-square  error 

?(7)  =  Ue(3— Y(*)la  (18.2) 

is  a  very  slight  magnitude. 

4.  Modulation  of  signal  (*;  x\  alf  <xm)  by  message  x(t)  —  direct  (deter¬ 
mination  of  direct  modulation  was  provided  in  §  7.1). 

3.  Statistical  characteristics  of  parasitic  signal  parameters  «i.  ....  amwill 
not  depend  on  time. 

Considering  these  assumptions*,  analysis  of  initial  expression  (17.51)  provides 
the  following  results  after  very  unwieldy  transforms  (see  [127],  for  example): 

1.  Optimum  estimate  f  (t)  of  the  reproduced  message  equals 

« 

ViO™  §  i\(t>r)*(i)eh+xW,  (18.3) 

where  impulse  response  (t,  T )  is  determined  from  integral  equation 

< 

*  5  C  (/.  «)  n  (S,  T)  ds  +  C  (f,  T)  -  Tl  (/,  T).  (18.4) 

t 

Additional  function  C(t,  T  )  included  in  this  expression  is  in  tum  determined 
from  the  following  integral  equation: 

t 

k\C{t,s)Rx{s,x)ds+C(t,  x)~R,(t,r).  (18.5) 

i • 

In  these  formulas,  tQ  —  moment  that  message  x(t)  reproduction  (measurement)  begins, 
while  t  —  actual  moment  in  time. 

*The  validity  of  these  assumptions  is  discussed  in  §  18.6. 


2.  Function  z(t)  and  constant  factor  k  included  in  the  formulas  are  determined 
from  the  following  expressions: 


(18.6) 


(18.7) 


the  superimposed  dotted  line  denotes  averaging  over  time  for  an  infinitely-long  /328 
time  interval). 

Function  Q(y,  x,  t)  will  be  found  from  the  equation 


9 

5  Q(y,  x,  t)di-—\nPg(y), 


(18.8) 


where  P  (y)  —  likelihood  function  for  realizations  y(t)  determined  in  interval 
(t  —  At)  t,  which  may  be  random  within  the  following  range: 


Tirep  in*  T«o>  TM  v 


(18.9) 


i.  e.,  interval  At  must  be  much  greater  than  noise  correlation  intervals 
and  tmp«  ,  but  leas  than  message  correlation  time  [in  via*  of  the  assumptions 
made  above,  simultaneous  satisfaction  of  inequality  (18.9)  is  possible]. 

Function  z(t)  may  also  be  represented  in  the  following  form: 


zW-AMO+Ae^Ol. 


(18.10) 


where,  just  as  before, 


«(f)-x(0— V<0. 


while  Aca  (0  —  stationary  white  noise  with  power  spectrum  (unilateral) 


(18.11) 


*H,#  “  T ' 


(18.12) 


4.  Error  6(t)  has  normal  distribution  with  zero  mean  value  ,  fe  (0  —  01  and 
variance 

°«*W  =  *V)  =C(/,  t),  (18.13) 

where  function  C(t,  t)  is  obtained  from  additional  function  C(t,  V  )  through  re¬ 
placement  of  T  by  t. 

The  equivalent  circuit  depicted  in  Figure  18.1  corresponds  to  relationships 
(18.3)— (18.8) ,  (18.10),  and  (18.11).  In  this  circuit, HHC (discriminator)  denotes 


Figure  18.1.  (a)  —  DIS  [Discriminator]; 

(b)  —  Fx  [Filter]. 

the  element  (in  the  general  case,  nonlinear)  which  computes  function  z(t)  by  pro¬ 
cessing  received  oscillation  y(t)  considering  measured  value  f  it)  \  ^  —  linear 

filter  with  impulse  response  7J(t,  T)  determined  by  equations  (18.4)  and  (18.5). 

It  is  easy  to  become  convinced  that  part  of  the  Figure  18.1  circuit  connected 
between  points  a  and  b  actually  converts  z(t)  into  fit)  in  accordance  with  /329 
equation  (18.3).  As  follows  from  Figure  18.1,  the  discriminator  converts  input 
signal-plus-noise  y(t)  into  z(t).  The  law  of  this  conversion  is  determined  from 
relationships  (18.6)  and  (18.8).  Consequently,  these  relationships  completely 
determine  discriminator  structure. 

It  follows  from  formula  (18.6)  that,  when  z(t)  is  formed  from  y(t),  measured 
signal  value  fit)  must  be  considered.  Consequently,  along  with  y(t),  measured 
message  value  'fit)  must  be  introduced  into  the  discriminator,  i.  e.,  a  feedback 
circuit  must  exist  from  meter  output  to  discriminator,  as  depicted  in  Figure  18.1. 
It  follows  from  expressions  (18.10) — (18.12)  that  discriminator  /INC  supplies  an 


oscillation  proportional  to  the  sum  of  error  signal  €(t)  and  white  noise  with 

spectrum  2/k,  i.  e.,  it  will  reproduce  error  signal  £(t)  with  an  error  characterized 
by  noise  Ae„  (/) . 


It  follows  from  Figure  18.1  and  formula  (18.10)  that  it  is  possible  to  use 
the  equivalent  circuit  depicted  in  Figure  18.2  rather  than  the  Figure  18.1  circuit 


Figure  18.2.  (a)  —  F£  [Filter]. 

when  computing  ^(t).  In  Figure  18.2,  2  —  linear  filter  with  impulse  response 

k7J(t,  T).  Consequently,  filter  2  differ  from  filter  $  (Figure  18.1) 
only  in  that  its  transfer  constant  is  increased  by  a  factor  of  k. 

It  follows  from  Figure  18.1  and  formulas  (18.10)  and  (18.12)  that  the  quality 
of  discriminator  operation  completely  is  characterized  by  single  parameter  k, 
which,  first,  is  the  discriminator  transfer  constant  (rate  of  change)  for  slight 
signal  error  values  and,  second,  determines  on  a  ane-to-an*  basis  noise  Ae„  (/) 
spectral  density  at  its  output. 

It  also  follows  from  examination  of  Figures  18.1  and  18.2  that  the  discriminator 
and  linear  filter  r  ^  in  an  optimum  system  perform  the  following  functions: 

a)  the  discriminator  must  separate  error  signal  £(t)  from  signal- plus-noise 
y(t)  with  minimum  error,  i.  e.,  insure  minimum  noise  spectral  density  ?n.o  at  its 
output ; 

b)  linear  filter  «  2  (and,  therefore,  filter  ^  ^)  must  insure  the  minimum 
mean  square  e7  of  message  x(t)  reproduction  error  in  the  Figure  18.2  circuit, 

i.  e.,  under  conditions  when  the  sum  of  error  signal  6(t)  and  white  noise  Ae„  (/) 
with  spectral  density  equalling  2/k  arrives  at  its  input. 

Independent  discriminator  and  linear  filter  system  design  is  possible.  Actually, 
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the  optimum  discriminator  structure  completely  is  determined  by  expressions  (18.6) 
and  (18.8),  which  will  not  depend  directly  on  filter  parameters.  It  is  /330 
possible  during  optimum  filter  P  ^  (or  ft  2)  system  design  to  start  from  equations 
(18.4)  and  (18.5)  or  directly  from  the  Figure  18.2  circuit  if  you  assume  given 
message  correlation  function  R  (t.,  t„)  and  require  that  minimum  mean-square  error 

X  1  z 

t(t)  be  obtained.  Here,  if  the  discriminator  still  is  not  considered  and  its 
parameter  k  still  not  determined,  it  is  possible  to  obtain  a  signal  decision  in 
a  general  form,  i.  e. ,  for  random  value  k. 


Since  the  majority  of  the  assumptions  made  previously  were  qualitative  [see 
relationship  (18.1),  for  example],  the  following  approach  is  possible  reliably 
to  determine  to  what  degree  a  system  structure  found  will  be  optimum  for  actual 
message,  signal,  and  noise  characteristics.  For  the  system  structure  found,  comput 
mean-square  error  er  through  analysis  of  the  passage  of  signal  and  noise  throuc^i 
the  system  and  compare  the  resultant  value  with  the  value  computed  from  formula 
(18.13).  If  both  values  ^tum  out  to  be  close  to  each  other,  then  this  will 
signify  that  the  system  structure  found  actually  is  sufficiently-close  to  optimum. 

It  should  be  noted  that,  during  optimum  linear  filter  02  (and*  of  P ±  as 
well)  system  design,  it  is  possible  in  the  Figure  18.2  circuit  to  assume  that 
x(t)  =  0  if  you  simultaneously  accept  that  the  mean  message  x(t)  value  at  system 
input  also  equals  zero.  Here,  the  result  obtained  will  be  valid  also  in  the  case 
when,  in  actuality,  x(t)  i  0.  This  is  explained  by  the  fact  that  the  mean  value 
of  measured  message  "fit)  in  the  Figure  18.2  circuit  equals  x(t)  and,  as  a  result 
of  this,  during  shaping  of  error  signal  C(t),  the  mean  values  of  processes  x(t) 
and  Tit)  mutually  compensate  for  each  other  and  they  do  not  reach  filter 
input. 

Consequently,  during  filter  P  2  sVJt®r,  design,  it  is  possible  to  begin  with 
the  Figure  18.3a  equivalent  circuit.  Here,  the  assumption  is  that  x(t)  =  0. 

It  is  possible  also  to  represent  the  Figure  18.3a  circuit  in  the  form  depicted  /331 
in  Figure  18.3b  since  the  transfer  of  disturbance  to  system  input  does  not 

change  the  effect  at  its  output.  But,  linear  filter  r  2  encompassed  by  feecback 
also  is  a  linear  system  with  some  impulse  response  * ) .  Therefore,  the  Figure 
18.3b  circuit  may  be  represented  in  the  form  shown  in  Figure  18.3c,  where  0, 

—  some  equivalent  linear  filter  with  impulse  response  q,(/,  t)  • 


ACn  Ith 

xftj  +/~\  ttti*x(thr(t[  A 


(D  ‘M. 


X(tl+Ain(t) 


Figure  18.3 

It  follows  from  Figure  18.3c  that  filter  <t> t  must  insure  reproduction,  with 
minimum  root-mean-square  error,  of  message  x(t)  arriving  at  its  input  in  a  mixture 
with  additive  white  noise  Aea(j).  This  problem  fully  coincides  with  that  presented 
in  §  2.2  for  optimum  linear  filtration. 

Therefore,  optimum  impulse  response  q»(*.  t)  must  be  determined  by  a  (2.21a)-type 
integral  equation,  i.  e.,  by  the  equation 

t 

j  Rx  (*.  s)  q,  (f,  s)  ds  +  $0,0*1,  (*»  T)  5=1  R*  (*»  *)• 

Comparing  this  with  equation  (18.3)  and  considering  that,  in  the  given  case 

c  _ tm  _  I 


we  obtain 


C  _  _  * 

T"  T* 


*!,(',  x )**kC{t,  t). 


i.  e.,  previously-introduced  additional  function  C  (t,  T)  is  (precise  to  constant 
factor  k)  the  impulse  response  of  equivalent  linear  filter  <&t,  accomplishing  optimum 
filtration  of  message  x(t)  in  the  Figure  18.3c  circuit. 
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It  follows  from  relationship  (18.4)  that  impulse  response  7|(t,T)  of  optimum 
linear  filter  0  ^  *s  linked  with  impulse  response  C(t,  T)  also  by  an  integral 
equation.  Thus,  two  integral  equations,  (18.4)  and  (18.5),  which  determine  the 
structure  of  optimum  linear  filter  r  ^  included  in  the  Figure  18.1  system,  have 
the  following  meaning:  a)  equation  (18.5)  makes  it  possible  to  find  impulse 
response  kC(t, T)  of  the  equivalent  linear  filter  connected  as  in  the  Figure  18.3c 
circuit  from  the  characteristics  of  message  x(t)  and  white  noise  Ae„(Q;  b) 
equation  (18.4)  makes  it  possible  from  impulse  response  kC(t,7*)  found  for  this 
filter  to  determine  the  corresponding  impulse  response  of  linear  filter  r  ^ 

(or  0  .),  i.  e.,  of  the  filter  encompassed  by  feecback.  Thus,  optimum  linear 
filter  (p  ^  structure  is  determined  by  two,  rather  than  one,  integral  equations 
because  this  filter  is  encompassed  by  feedback. 


Hence,  it  also  follows  that  the  theory  of  optimum  nonlinear  filtration  includes 
the  theory  of  optimum  linear  filtration  as  a  particular  case. 


It  follows  from  relationships  (18.6) — (18.9)  that,  during  optimum  discriminator 
system  design,  input  mixture  y(t)  is  examined  in  interval  At,  slight  compared 
to  T„op  x  ,  but  large  compered  to  *imp*  and  TXPp  m  .  This  signifies  that  the  optimum 
discriminator,  having  significant  sluggishness  (and,  consequently,  filtering  /332 
properties  as  well)  with  regard  to  noise  and  signal  fluctuations,  at  the  same 
time  is  essentially  inertia-free  with  regard  to  reproduced  message  x(t). 

Thus,  the  optimum  system  turns  out  to  comprise  two  considerably-different 
elements:  a  nonlinear  (in  the  general  case)  element  inertia-free  with  regard 
to  message  x(t)  (discriminator)  and  a  sluggish  [both  for  noise  and  for  message 
x(t)]  linear  element — filter  0.. 


This  is  due  to  the  nature  of  the  assumptions  made  during  system  design,  primarily 
those  concerning  the  slowness  of  the  message  x(t)  change  and  insignificance  of 
the  error  in  its  reproduction. 

Actually,  for  example,  let  us  design  a  radio  receiving  system  operating  in 
accordance  with  the  simplest  circuit,  i.  e.,  comprising  an  rf  amplifier  (including 
rf  amplifier,  frequency  converter,  and  if  amplifier),  detector,  and  If  amplifier. 

It  is  known  that,  given  powerful  noise  and  a  broad  message  spectrum,  best  message 
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reproduction  requires  primarily  narrowing  the  bandwidth  ahead  of  the  detector, 
even  if  this  causes  some  message  distortions. 


If  this  receiver  is  designed  for  high  message  fidelity  and,  consequently, 
will  operate  in  noise  that  is  not  too  strong,  while  the  message  spectrum  width 
is  small,  then  it  is  possible,  without  sacrificing  message  fidelity,  to  make  the 
bandwidth  ahead  of  the  detector  sufficient  for  undistorted  (when  there  is  no  noise) 
message  reproduction  and  to  narrow  the  bandwidth  only  beyond  the  detector. 

Analogous  to  this  and  in  the  optimum  system  examined  above,  it  turns  out 
to  be  possible,  given  such  conditions,  to  make  the  entire  system  essentially  inertia 
free  with  regard  to  reproduced  message  x(t),  with  the  exception  of  sluggish  linear 
input  networks — linear  filter  fiy 

The  circumstance  that  optimum  filter  at  discriminator  output  is  a  linear 
system  is  stipulated  during  system  design  by  the  assumption  accepted  concerning 
the  normal  message  x(t)  law  of  distribution.  Given  another  law  of  message  distri¬ 
bution,  optimum  smoothing  networks  at  discriminator  output  turn  out  to  be  nonlinear. 

However,  the  formulas  presented  above  for  optimum  system  design  are  valid 
also  for  a  law  of  message  distribution  differing  from  the  norm  if,  in  accordance 
with  problem  conditions  (with  respect  to  considerations  of  simplicity,  for  example), 
filter  i  at  discriminator  output  must  be  linear. 

We  now  will  explain  the  significance  of  the  feedback  with  respect  to  /333 
reproduced  message  x(t)  present  in  the  optimum  system  (Figure  18.1).  When  error 


Figure  18.4 


signal  €(t)  values  are  slight,  discriminator  rate  of  change  equals  constant  magnitude 
k,  i.  e.,  discriminator  response  curve  z  *  z(f)  is  linear.  However,  when  €  values 
are  large,  this  response  curve  turns  out  to  be  nonlinear  (Figure  18.4).  Therefore, 
difference  «  —  v  must  be  kept  sufficiently  small  in  order  for  the  discriminator 

to  operate  normally.  The  true  message  x(t)  value  will  be  contained  in  mixture 
y(t)  arriving  at  discriminator  input}  the  reproduced  value  of  this  message 
is  shaped  only  at  the  output  of  this  system.  Therefore,  it  is  necessary  to  introduce 
7«>  into  the  discriminator  via  the  feedback  network,  as  depicted  in  Figure  18.1. 

The  necessity  to  introduce  a  priori  known  mean  message  value  x(t)  at  system 
output  (Figure  18.1)  also  is  quite  understandable.  As  previously  noted,  thanks 
to  this,  determinate  component  x(t)  contained  in  reproduced  message  x(t)  turns 
out  to  be  compensated  for  at  the  input  of  filter  ^2»  and  of  alao»  and» 

consequently,  filter  00  longer  most  reproduce  it.  This  makes  it  possible, 

all  other  conditions  being  equal,  to  narrow  the  filter  Q  ^  bandwidth  and  to  decrease 
the  noise  action  accordingly. 

So,  the  aforementioned  relationships  make  it  possible  to  determine  the  structure 
of  the  optimum  message  x(t)  reproduction  system  and  to  find  the  potential  (minimum- 

possible)  value  of  message  x(t)  reproduction  error  V'V  (0  . 

All  these  relationships  were  obtained  for  a  case  when  signal-plus-noise  y(.) 
at  system  input  may  be  represented  in  the  form  of  expression  (17.55).  However, 
the  case  often  occurs  when 

» (0  (*;  e;  a, . aj  +  um  (t),  (18.14) 

i.  e.,  it  is  not  reproduced  parameter  x(t)  iti  >t  is  introduced  directly 
into  the  expression  for  the  signal,  but  rather  3igra»l  error 

•  (18.15) 

the  task,  as  usual,  boils  down  to  best  message  x(t)  reproduction,  i.  e.,  to  meeting 
condition 


IV. 
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Such  a  situation  occurs,  in  particular,  when  designing  automatic  target  tracking 
system  using  angular  coordinates. 


A  functional  schematic  of  such  a  system  is  depicted  in  Figure  18.5.  Axis  /334 
z  of  this  system's  equisignal  zone  at  each  moment  in  time  is  directed  at  the 
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* 
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Figure  18.5.  (a)  —  Receiver  (direction  finder); 

(b)  —  Antenna  control  unit;  (c)  —  Target. 

target  with  the  aid  of  motors  included  in  the  antenna  control  unit.  Error  signal 
z(t)  characterizing  deviation  €  of  axis  z  from  the  direction  to  the  target  reaches 

9 

the  input  of  this  unit  from  radio  receiving  device  output.  It  is  possible,  given 
slight  angular  deviations,  to  assume  that 


z{ O-AMQ  +  Ae.Wl, 


(18.17) 


where  k  -  radio  receiving  device  (radio  direction  finder)  gain  with  respect  to 
the  error  signal,  Aea  (/)  —  measurement  error  caused  by  noise  action  and  reduced 
to  angular  deviation  e . 


Evidently,  in  this  case,  the  target  angular  coordinate  and  equisignal  zone 
z&  angular  coordinate  (coordinates  in  a  certain  fixed  system)  play  the  role  of 
reproduced  message  x(t)  and  the  result  of  its  reproduction  f(,t)  supplied  by  the 
system,  respectively.  Designations  in  Figure  18.5  for  simplicity  are  given  relative 
to  operation  of  one  of  two  antenna  control  system  channels  (for  the  angle  of  sight 
antenna  control  channel,  for  example) ;  therefore,  instead  of  vector  magnitudes 

x,  'J't  and  £  »  only  their  scalar  components  located  in  one  plane  are  supplied, 
while  the  fixed  coordinate  system  is  supplied  by  direction  N. 
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Given  highly-accurate  automatic  tracking,  the  antenna  control  unit  may  be 
considered  linear  and  the  autotracking  system  structural  schematic  may  be  represented 


-v  €(tbx(thr(tf  ✓k 

1  «zn 

1 

Figure  18.6.  (a)  —  F  [Filter]. 


in  the  form  shown  in  Figure  18.6.  In  this  figure,  p  —  linear  filter  with  transfer 
constant  equalling  (precise  to  constant  factor  k)  the  antenna  control  unit  transfer 
function. 


This  structural  schematic  completely  coincides  with  the  servomechanism  struc¬ 
tural  schematic  depicted  in  Figure  18.3a.  Therefore,  the  circumstance  that,  in 
this  case,  the  automatic  target  tracking  system  structural  schematic  was  given 
beforehand,  i.  e. ,  prior  to  beginning  mathematical  system  design,  does  not  hinder 
obtaining  optimum  results  corresponding  to  the  case  of  the  above  optimum  servo¬ 
mechanism  system  design.*  The  difference  lies  only  in  the  fact  that,  in  the  case 
of  the  autotracking  system  under  examination  (Figure  18.5),  mixture  y(t)  of  /335 
signal  with  additive  noise  must  be  represented  in  the  (18.14)  form,  rather  than 
the  (17.55)  form,  since  the  signal  at  antenna  system  output  is  modulated  by  error 
signal  €  (t)  a  x(t)  —  Y(t),  rather  than  by  mesaage  x(t)  itself. 

It  is  clear  from  the  above  optimum  system  analysis  that,  here,  all  relationships 
characterizing  this  system's  structure  and  properties  remain  valid  if,  in  formulas 
(18.6),  (18.7),  and  (18.8),  signal  parameter  x  is  replaced  by  new  signal  parameter 
€ ,  i.  e.,  to  assume 


"Strictly  peaking,  the  difference  will  be  that,  in  the  optimum  system  (Figure 

18.1),  input  of  a  priori  known  expected  value  x(t)  is  envisaged,  while,  in  the 

Figure  18.5  ayatam,  it  is  not  envisaged.  Therefore,  if  x(t)  i  0,  then  input  of 

function  x(tj  at  antenna  control  unit  output  must  be  envisioned  to  eliminate  this 
difference  in  the  Figure  18.5  system. 
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(18.18) 


(18.19) 

(18.20) 


Here,  as  before,  P <  (y) 

At)  probability  density 
€  (t). 

Examples  of  the  use  of  the  resultant  general  relationships  to  determine  optimum 
system  structure  (optimum  discriminator  and  linear  filter  structures)  and 
computation  of  potential  message  fidelity  will  be  presented  in  subsequent  sections. 

18.2  Optimum  Discriminator  System  Design 

The  following  sequence  of  operations  for  optimum  discriminator  system  design 
flows  from  general  relationships  (18.6)— (18.8)  or  (18.18)— (18.20)  and  (18.12) 
presented  in  the  preceding  section. 

First,  we  compute  likelihood  function  Px(y)  and  its  logarithm  In  Px(y). 

Then,  the  expression  found  for  In  Px(y)  is  written  in  the  form  of  an  integral 
from  some  time  function  in  the  (t  —  At)  —  t  range.  Here,  the  integrand  also 
will  be  Q(y,  x,  t). 

Next,  differentiating  Q(y,  x,  t)  with  respect  to  x  and  replacing  x  for 
in  the  resultant  expression,  we  will  find  oscillation  z(t)  at  discriminator  output. 
The  resultant  expression  for  z(t)  demonstrates  which  mathematical  operations  must 
be  performed  on  received  oscillation  y(t)  in  the  discriminator,  i.  e.,  determines 
discriminator  structure. 

It  follows  from  expressions  (18.7)  and  (18.12)  that  the  spectral  density 
gm  of  noise  Aea  (0,  which  is  reduced  to  error  signal  6  (t),  is  determined  from 
the  formula 


—  likelihood  function,  i.  e. ,  realization  y(t)  (at  interval 
(multidimensional)  for  a  given  realization  of  signal  parameter 


f3Q(r. »,  t) 

8 1 


]•— « 


*(<)--[■ 

*„p§pj, 


i 

J 

I -A I 


S  Qbt,  z,t)  dt  --IrtP.te). 
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8m*  m  k 


q  (y,  X,  Q  j» 


(18.21) 


i.  e.,  computation  of  ga* requires  finding  the  second  derivative  with  respect  /336 
to  x  from  Q(y,  x,  t)  and  averaging  the  resultant  expression  with  respect  to  all 
possible  realizations  (superimposed  thin  line)  and  with  respect  to  time  (superimposed 
dotted  line).  If  the  process  is  ergodic,  then  averaging  only  with  respect  to 
time  in  formula  (18.21)  suffices. 

As  our  example,  we  will  examine  a  case  when  the  signal  is  determinate  (precisely 
known,  except  for  x),  while  additive  noise  «m(0  has  the  form  of  normal  white  noise. 
Here,  the  likelihood  function  is  determined  from  expression  (4.10),  which  in  this 
case  may  be  written  in  the  form 


J  hr (0 -■•(*.  *)!*<«  . 


(18.22) 


where  c  —  some  constant. 


Therefore 


.  t 

In /’■(f)— —  Inc-f  —  C  lr(0 -«*((,  *))*  dt. 
*  f-Af 


(18.23) 


Since  only  partial  derivatives  with  respect  to  x  will  be  included  in  the  expressions 
for  z(t)  and  gm»  ,  then,  as  is  easy  to  see,  expression  (18.23)  terms  not  depending 
on  x  do  not  affect  the  results  and,  consequently,  they  may  be  disregarded.  Here, 
comparison  of  expressions  (18.8)  and  (18.23)  provides 


<?<*.  *,  0 Or  (0-n  «.*>!•. 


(18.24) 


Substituting  this  relationship  into  (18.6),  we  obtain 


«(0-»(Q«tV.  7) +«»(<.  7). 


(18.25) 


WWW 
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..  .  2r*ta«,joi 

“''•«-5;l-jr-J„, 


(18.26) 


(18.27) 


The  discriminator  structural  schematic  depicted  in  Figure  18.7  corresponds  to 
these  expressions,  i.  e.  the  discriminator  will  comprise  a  synchronous  detector 
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Figure  18.7.  (a)  —  Discriminator; 

(b)  —  G  [Generator]. 

(multiplier),  adder,  and  generators  A  and  f  ,  of  time  functions  u^(t,  ~J  )  and 
u2(t,/),  the  only  unknown  parameter  of  which  is  oscillation  ^ (t)  supplied  from 
servomechanism  output  (see  Figure  18.1). 

For  example,  let  signal  u  (t,  x)  be  modulated  with  respect  to  amplitude  by 

w 

the  message,  i.  e., 


«•(<.  *)*(/,  (i + iur  «>|  0  CO. 


(18.28) 


where  UQB(t)  —  signal  carrier  oscillation.  Then,  from  formulas  (18.26)  and  (18.27) 

we  obtain 


l/i.»*.  »*.  •-'.  r**  « 


V  V  ■  ^ "f* 


and 


! 


“»  (*.  v)  -  -  ^  t/.*  e»  (/)  [  i + n  mi. 


(18.30) 


Since  signal  carrier  oscillation  UQB(t)  and  parameters  £4  and  Nq  are  assumed  known, 
then  these  expressions  completely  determine  generator  rx  and  r,  structure. 

The  following  expression  is  obtained  (considering  the  insignificance  /337 
of  magnitude  e  =  x  —  y)  from  formuia8  (18.21)  and  (18.24)  for  the  spectral  density 
of  the  noise  at  discriminator  output: 


(18.31) 


For  amplitude  modulation,  when  signal  uc(t,  x)  is  determined  from  expression  (18.28), 
formula  (18.31)  provides 


where 


(18.32) 


P#cp«tVBr(0 


(18.33) 


—  specific  carrier  oscillation  mean  power  (here  and  in  future,  power  developed 
in  a  unitary  resistance  is  called  specific). 


We  now  will  examine  a  case  of  a  fluctuating  signal  of  the  type 

«•(<;  *;  «1»  ....  «m)  -  a  (/,  x)  E  (0  cos  l»,  t+y[t,x)  +  v  (I)).  (18.34) 

Here,  E(t)  and  <^(t)  —  signal  amplitude  and  phase  fluctuations,  respectively, 
while  the  assumption  is  that  E(t)  cos  <^(t)  and  E(t)  sin  <|)(t)  have  normal  laws 
of  distribution  with  zero  mean  values  and  known  correlation  function  PW. 

Here,  E(t)  is  normalized  so  that 

EtyJ  -  2P*,  (18.35) 
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—  specific  mean  signal  power;  u(t,  x)  and  y  (t,  x)  —  usable  amplitude  and  phase 
signal  Modulation  by  message  x.  [If  a  signal  ia  modulated  only  with  respect  to 
amplitude,  then  t,  x)  §  0;  if  it  is  modulated  only  with  respect  to  phase,  then 
u(t,  x)  s  1}.  Function  u(t,  x)  is  normalized  so  that 


tin  the  case  of  a  pulse 
s  0)  spacing  f* .  The  aasum 
there  are  no  fluctuations  <p 


H*Tt73  — 1. 

signal,  u(t,  x)  — 
t  ion  hire  is  that 


(18.36) 


•  function  with  (where  x 
pulses  are  coherent  when 


We  will  assume  that  signal  fluctuations  are  relatively  slow,  i.  e.,  function 
p  (t)  changes  slowly  compared  with  u(t,x)  ami  f  it,  x).  However,  even  given 
such  limitations  placed  on  the  nature  of  the  signel  fluctuations,  resultant  /338 
calculations  are  more  unwieldly  than  is  the  case  for  the  aforementioned  regular 
signal  (see  (127*),  for  example).  Therefore,  se  will  present  here  only  the  most 
important  result  (obtained  in  (1273)  «— *  the  formula  for  determination  of  the  spectral 
density  of  the  noise  at  discriminator  Oftpst?  given  additive  noise  *«  (Oin  the 
fans  of  normal  whits  noise: 


where 


yg-±  f  J£M 

£.1  + 


(18.37) 


(18.38) 


$t(m)  —  signal  fluctuation  power  spectrum  (bilateral),  linked  with  fluctuation 
correlation  function  P  (t)  by  a  Fourier  transform: 


P(T)  ~t~  j  Se  ((•))  coS  out  <f<o; 


(18.39) 


e» 

P(°)=--  s  Se (on) da  =■  Pep. 


(18.40) 


If  there  is  no  phase  modulation  \j/(. t,  x),  then,  the  result  from  (18.37)  is 


du  (t,  *)1* 

*•[  a*  J  . 


(18.41) 


Formula  (18.37)  significance  is  that  it  makes  it  possible  to  determine,  for 
a  fluctuating  signal,  the  magnitude  of  the  spectral  density  of  noise Ae„(/)at  optimum 
discriminator  output,  while  this  spectral  density,  as  follows  from  Figure  18.3, 
determines  potential  message  fidelity. 


If  signal  fluctuation  correlation  function  P  W  has  an  exponential  nature, 
the  spectrum  S0  (®)has  the  form 


Sfl  l+(e/2A/o)*' 


(18.42) 


where 


5  /<*) .  5si!l , 

5e(0) 
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Substituting  expression  (18.42)  into  (18.36)  provides 


(18.43) 


—  equivalent  width  of  fluctuation  normalized  spectrum  S»(©) ,  i.  e. ,  the  /339 
spectrum  determined  from  relationship 


(18.44) 


(l  +  KITS')  vT+sr 


where 


h=  -^L-  (18.46) 

*.A/„ 

is  the  ratio  of  signal  mean  power  to  the  mean  additive  noise  power  in  the  band 
equalling  equivalent  width  of  the  signal  fluctuation  spectrum. 


Given  a  high  signal-to-noise  ratio  (h  ^1),  from  (18.45)  we  obtain 


V,«2  (18.47) 

Here,  if  there  is  no  phase  modulation  \jj{ t,  x),  for  gm  „  formula  (18.41)  is  valid, 
from  which,  considering  (18.47),  we  obtain 


am  » - —  • 

(18.48) 

But,  when  there  is  no  phase  modulation  l^(t,  x),  from  (18.34)  we  obtain  [considering 
E(t)  slowness  compared  to  u(t,  x)] 


Therefore,  formula  (18.48)  may  be  written  also  in  the  following  form: 


(18.49) 


This  expression  completely  coincides  with  relationship  (18.31)  for  a  determinate 
signal . 


Thus,  for  the  aforementioned  conditions  [i.  e.,  where  h  1,  ^(t,  x)  ==0 
and  signal  fluctuation  slowness  compered  with  u(t,  x)],  the  spectral  density 
gmt  value  and,  consequently,  potential  message  x(t)  fidelity  as  well,  is  obtained 
for  a  fluctuating  signal,  just  as  for  a  determinate  signal  with  the  same  mean 
power.  However,  in  the  general  case,  signal  fluctuations  will  lead,  naturally,  /340 
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to  a  deterioration  in  potential  accuracy,  i.  e.,  to  an  increase  in  spectral  density 
8a  9 (given  the  same  mean  signal  power) . 


18.3  Optimum  Linear  Filter  Y^(of  Optimum  Smoothing  Networks)  System  Design  and 
Determination  of  Potential  Message  Fidelity 

As  noted  in  §  18.1,  optimum  filter^  structure  (Figure  18.1)  completely 
is  determined  from  relationships  (18.4)**“(18.6),  while  potential  fidelity  is  determined 
from  formula  (18.13).  A  detailed  study  of  these  relationships  for  various  message 
x(t)  characteristics  is  presented  in  [127].  We  will  limit  ourselves  here  for 
brevity  to  examination  only  of  several  most-characteristic  cases. 

We  will  begin  with  examination  of  a  case  when  message  x(t)  may  be  considered 
a  stationary  random  process  and  may  be  restricted  to  minimization  of  the  root-mean- 
square  error  only  for  the  steady-state  mode,  i.  e,  to  assume  that  t  =  —  oo . 

Here,  optimum  linear  filter  0^  has  constant  parameters  and,  consequently,  completely 
is  characterized  by  its  impulse  response  •  *1  V  — T)  or  transfer  function  (fa)  . 

In  this  case,  it  is  more  convenient  to  find  these  characteristics  by  reducing 
the  problem  to  optimum  linear  filtration,  examined  in  §  2.2,  rather  than  direct 
solution  of  integral  equations  (18.4)  and  (18.5). 


It  is  evident  from  the  Figure  18.3b  and  18.3c  circuits  that  equivalent  filters 
and  4**  have  the  following  transfer  functions,  respectively 


(18.50) 


and 


*.</») 


*,(/«) 

l+/?i  (/•») 


(18.51) 


The  Figure  18.3c  circuit  completely  coincides  with  the  optimum  linear  filtration 
circuit  Wiener  examined  and  which  was  presented  in  §  2.2.  Therefore,  formula 
(2.14)  is  valid  for  optimum  transfer  function  K»(/®)  system  design.  Since  simple 
message  reproduction  [h(t)  =  x(t)]  is  required  in  this  case,  while  message  x(t) 
and  noise  (/)  may  be  considered  statistically  independent,  then 


S9h (•) « S„ (»);  S, (to) - Sx(a)  +  Sm (to), 


(18.52) 


and  formula  (2.14)  provided 


[  ''"‘"I  (18-53) 

where 

¥(/«»)♦(— AO  —  It  (AO  I*  (18.54) 

Power  spectrum  S-,  (<a)  may,  without  a  significant  loss  of  generality,  /341 
be  represented  in  the  form  of  the  rational  function 

S,(»)-±2sd£)l-,  (18.55) 

where  Qm  (/«)  and  P«(/«)  —  polynomials  of  degrees  m  and  n  (where  n  >  m)  of  /« 
with  weight  factors.  Here,  functions  t  (/»)  and  $  (—/m) included  in  expression  (18.53) 
may  be  found  by  the  method  of  special  expansion  (factorization)  of  function 
S„  (m)  into  factors  with  zeros  and  pluses  in  the  upper  half- plane  [at  t  (/®)] 
and  in  the  lower  half-plane  [att(— /«■)]  of  complex  variable  U  (see  [127],  for 
instance) . 

For  practical  use,  it  usually  is  more  convenient  to  represent  formula  (18.53) 
in  the  following  form  [127]: 


(18.56) 


where  the  operation  []+  denotes  the  taking  of  that  term  of  the  expression  in  brackets 
which  has  pluses  in  the  upper  half-plane  of  complex  variable  U . 


It  follows  from  formulas  (18.50)  and  (18.51)  that 

Kt  W-7^,  (18.57) 

Therefore,  having  computed  magnitude  /C,(A0  from  formula  (18.56),  it  then  is  possible 
to  find  needed  filter  0^  optimum  transfer  function  Kt  (/»)  from  formula  (18.57). 


The  methodology  Wiener  demonstrated  for  finding  optimum  transfer  function 
insures  physical  realizability  of  this  transfer  function.  Filter  KtQ to) 
turns  out  also  to  be  physically  realizable  when  frequency  spectrum  S„  (to)  is 
represented  in  a  (18.55)-type  of  rational  function  and,  consequently,  the  methodology 
presented  above  for  its  system  design  is  valid. 

As  our  example,  we  will  examine  a  case  when  the  message  x(t)  spectrum  may 
be  represented  in  the  form 


l  +  W 


(18.58) 


where  «•*  —  variance  of  magnitude  x,  while  1/T  —  width  of  the  spectrum  at  the 
half-power  level. 

Since,  in  the  case  being  examined,  Ae„  (t)  —  white  noise  with  unilateral 
spectral  density  gmt  -  2k,  then  its  bilateral  spectral  density  equals 


I/*. 


(18.59) 


and,  considering  (18.52)  and  (18.59),  we  have 


a  gB»n-fp+(«r)»i 

2(!+(»n*i  * 


(18.60) 


where 


P--22S1I.  (18.61) 

tut 

Next,  in  accordance  with  (18.54),  we  will  expand  function  $»(»)  into  factors 
*(/«)  and  M-M  containing  zeros  and  pluses,  respectively,  in  the  upper  and  in 
the  lower  half-planes  of  frequency  U ,  considered  a  complex  variable.  In  the 
given  case,  this  expansion  is  evident: 


»  .  .  8m o  W i +p+/«r)  (V  i  +p— /»r) 

Su  (C0)  “  — ~  _  - 


d  +  /»r)(i-/«r) 


(18.62) 


In  accordance  with  (18.56),  we  will  find 


r  5,(«)  i  g-.pr  - 1  . 

L  ♦  (-/«)  J+“  2  III  +(«r)*l(/T+p-i«f)  J+ 


(■ _ Pgm  0 

'l  2(i+/®r)(/rrp 


“+p —  /«»r)  ]+ 


i  +  /®r  ^VT 


- 1  . 

i+P-  /«r  J+ 


J+  2(/r+P+i) 

_ pgm  i 

2(vr+p+0(i+/®r)  ■ 


(18.64) 


Substituting  relationships  (18.63)  and  (18.64)  into  (18.56),  we  obtain 

“(VTTp+i)(VrTp+/er)- 


(18.65) 


Here,  formula  (18.57)  provides 


-7TF&f+l)  „+U- 


(18.66) 


Consequently,  in  the  case  under  examination,  optimum  filter  must  comprise 
an  amplifier  with  gain 


v  flg« » 

K#“2(KT+P+I) 


i+«£+i 

gm  •  / 


(18.67) 


and  an  inertia  component  with  constant  time  T. 


We  now  will  find  potential  message  measurement  accuracy  determined  from  rela¬ 
tionship  (18.13): 


0,iw7io-C(i,i). 


(18.13) 


As  noted  in  §  18.1,  kC(t,  O  is  the  filter  impulse  response  (Figure  18.3c) 
In  the  case  being  examined,  this  filter  has  constant  parameters  and,  consequently, 
C(t,T)  a  C(t  —  T).  Therefore,  a  C(0). 


Considering  that  a  Fourier  transform  links  the  filter  impulse  response 
kC(t  —  T)  with  its  transfer  function  /C»  (/«)  ,  the  following  relationship  may 
be  obtained  [127]: 


*C(0)h>lim /»*•(/»). 


(18.6 


Therefore, 


0  <*4- 1lm/«Ka(/e). 

A  »-m 

and,  considering  (18.61)  and  (18.65),  we  have 

P  20§* 

*riK>+p+u"  yi+p+i  * 


(18.6 


where 


40,«r 


(18.7 


For  comparison,  we  now  will  examine  solution  of  the  same  problem  using  a  sim¬ 
plified  Bode- Shannon  method,  in  which  no  condition  of  physical  accomplishment 
is  placed  on  the  filter  system  being  designed.  Here,  in  accordance  with  formulas 
(2.19)  and  (2.20),  we  have 


*.<M’ 


+  S.(e) 


(18.7 


l  T*  *  P  dm 

J  7+55’ 


(18.7 


Substitution  of  expressions  (18.58)  and  (18.60)  into  these  formulas  provides 


*,</«)- 


l+P+l^F)* 


(18.7 


It  follows  from  (18.57),  (18.59),  and  (18.73)  that 


Kt  (/»)  — 


Mat 

2 


1 

i+W 


(18.75) 


As  is  evident,  resultant  transfer  function  Kt  (/»)  is  not  physically  realizable, 
but  the  potential  accuracy  value  found  here  (18.70)  slightly  differs  from  corres¬ 
ponding  value  (18.70)  obtained  for  a  physically-realizable  filter.  This  example 
confirms  the  validity  of  the  assertion  made  in  §  2.2  that  the  simplified  Bode-Shannon 
method  is  applicable  for  approximate  estimation  of  potential  message  fidelity. 


The  aforementioned  analysis  was  performed  for  a  case  when  reproduced  message 
x(t)  may  be  considered  a  stationary  random  process.  However,  in  a  number  of  cases, 
message  x(t)  reproduction  itself,  but  only  its  first  or  higher  derivative  with 
respect  to  time,  may  be  considered  a  stationary  process.  For  example,  if  aircraft 
acceleration  changes  randomly  and  stationarily,  then  its  velocity  (an  integral 
of  acceleration)  is  a  random  process  with  a  stationary  first  derivative,  while 
its  coordinate  is  a  random  process  with  a  stationary  second  derivative.  Processes 
of  this  type  fall  in  the  class  of  so-called  processes  with  stationary  increments 
(stationary  derivatives). 


The  methodology  for  design  of  linear  filter  systems  for  such  processes  /344 
is  presented  in  [127].  We  will  limit  ourselves  to  examination  only  of  the  simplest 
case  when  message  x(t)  may  be  approximated  by  the  integral  of  stationary  white 
noise  (such  a  process  also  often  is  called  a  Wiener  process). 

The  correlation  function  of  such  a  process  equals 

Bt  where  t  <  t; 

|  (18.76) 

Bt  where  1  > 

The  resultant  solution  of  equations  (18.4)  and  (18.5)  for  this  case  in  [127]  provides 
the  following  results: 


Substituting  these  relationships  into  expressions  (18.3)  and  (16.13),  we 
obtain 


(18.79) 


and 


(18.80) 


where  k  -  2/*^,  . 


It  follows  from  (18.78)  and  (18.79)  that,  in  the  case  being  examined,  the 
linear  filter  must  comprise  discriminator  z(t)  output  voltage  amplifier,  with 
amplification  changing  over  time  in  accordance  with  the  law  of  the  hyperbolic 
tangent,  i.  e.,  with  a  rise  from  zero  to  VSIi) ,  and  a  subsequent  integrator. 

Here,  as  follows  from  (18.80),  the  message  reproduction  error  variance  also  rises 
in  accordance  with  the  law  of  the  hyperbolic  tangent,  striving  towards  steady-state 
value  VB77  . 

As  our  final  example,  we  will  examine  a  case  when  message  x(t)  may  be  represented 
in  the  form  of  a  linear  combination  of  known  functions  with  random  factors,  i.  e. , 
in  the  form 


*(0-777)+ J|A*M0. 


(18.81) 


For  the  simplest  case,  when 


x(t)-x  <!)+/«/ <0  (18.82) 

and  parameter  A  has  a  normal  law  of  distribution  with  a  zero  mean  value  and  variance 


the  following  results  are  obtained  in  [127]: 


and 


IL- 


***ni)tto) 

1  +  Ae»*  }/•(«)«*• 


Here,  in  accordance  with  formulas  (18.3)  and  (18.13),  we  have 


(18.83) 


(18. B4) 
/345 


(18.85) 


t(0' 


i 

■ 


&1SQ112JJZL  dr+Tjlf. 

l  +  (>)da 


(18.86) 


(18.87) 


The  optimum  filter  'P  ^  structural  schematic  depicted  in  Figure  18.8,  it 
is  easy  to  see,  corresponds  to  expression  (18.86).  This  circuit  will  comprise 
two  multipliers,  adder,  integrator,  and  three  generators  of  known  time  functions 
UjU),  u2(t),  and  Uj(t). 


It  follows  from  formula  (18.87)  that  measurement  error  variance  and  the  error 
itself  will  become  equal  to  zero  in  the  steady-state  mode  (where  t  11  »  —). 

This  is  because,  in  the  case  under  consideration,  the  only  unknown  is  message 
x(t)  constant  parameter  A  and,  when  t  1  it  may  be  measured  precisely. 


Figure  18.8.  (a)  —  F.  [Filter] 

(b)  —  Gj— Gj  [Generators]. 

The  examples  presented  above  for  design  of  a  linear  filter  system  (for 
smoothing  networks)  and  for  determination  of  potential  message  fidelity  clearly 
confirm  the  following  general  maxims: 

1.  The  structure  of  an  optimum  linear  filter  and  its  parameters  very  signi¬ 
ficantly  will  depend  on  the  nature  of  reproduced  message  x(t)  and  on  whether  high 
message  fidelity  is  required  only  in  the  steady-state  mode  (where  tQ  =  —  00  or 

t  -*«>)  or  is  required  in  the  transient  mode  as  well. 

2.  Optimum  linear  filter  parameters  will  depend  on  the  magnitude  of  the 
spectral  density  of  the  noise  at  discriminator  output,  which,  in  turn,  /346 

will  depend  on  the  signal  power  to  noise  power  ratio.  Consequently,  filter  parameters 
must  change  when  this  ratio  changes.  No  such  change  is  envisioned  in  the  optimum 
system  examined  above  (Figure  18.1)  since,  during  its  system  design,  all  a  priori 
signal  and  noise  characteristics  were  assumed  known.  Therefore,  in  an  actual 
system  operating  under  conditions  whereby  many  of  these  a  priori  characteristics 
are  unknown,  the  accuracy  in  the  general  case  will  be  lower  than  its  potential 
accuracy  found  above,  even  if  you  envisage  in  this  system  devices  to  measure  noise 
and  signal  parameters  (because  such  measurements  may  not  be  precise  due  to  the 
noise  action). 

3.  Potential  message  fidelity,  characterized  by  the  magnitude  of  error  variance 
a,*  ,  monotonically  will  depend  on  the  spectral  density  im.%  of  the  noise  at 


discriminator  output.  The  variance  will  strive  monotonically  towards  zero  when 
spectral  density  gm  «strives  towards  zero. 


Such  a  result  is  understandable  since,  in  the  optimum  system  structural  schematic 
(Figure  18.2),  noise^e,,  (/)with  spectral  density  gmt  is  the  single  primary  source 
of  message  reproduction  errors;  when  this  noise  is  absent  (  gat  =0),  message 
x(t)  reproduction  may  be  as  precise  as  desired,  given  a  sufficiently-broad  filter 
bandwidth. 

Hence,  it  follows  that  spectral  density  gm  « ,  characterizing  the  potential 
accuracy  of  discriminator  action,  is  a  vital  parameter  determining  the  potential 
accuracy  of  system  operation  as  a  whole. 

18.4  Potential  Accuracy  Given  Complex  Message  Reproduction  and  Indirect 
Modulation  Types 

The  results  presented  in  §  18.3  applied  to  direct  signal  modulation  types 
and  to  simple  message  reproduction.  We  now  will  explain  the  special  features 
which  arise  for  indirect  modulation  types  and  during  complex  message  reproduction. 
Here,  we  will  restrict  ourselves  to  examination  of  the  problem  of  potential  deter¬ 
minate  signal  fidelity. 

It  was  demonstrated  in  §  18.2  that  the  presence  of  signal  fluctuations  during 
the  same  signal  mean  power  may  cause  deterioration  in  message  fidelity  or  keep 
it  essentially  unchanged.  Therefore,  the  potential  accuracy  value  found  for  a 
determinate  signal  may  be  considered  threshold  potential  accuracy,  which  may  not 
be  exceeded  regardless  of  actual  signal  shape. 

During  indirect  modulation,  oscillation  y(t)  at  measurement  system  input 
has  the  following  form; 


y  (#)  “  «e  (ft  *t)  +  K«(0. 


(18.88) 


where 


(18.89) 


Here,  (p)  —  linear  differential  operator}  p  =  d/dt  —  elementary  differen-  /347 
tiation  operator}  x(t)  —  measured  parameter}  x^(t)  —  direct  parameter,  i.  e., 
parameter  included  in  function  u  (t,  x.)  directly,  rather  than  under  the  sign 

C  1 

of  the  operator. 


Measurement  of  signal  frequency  Doppler  shift  Fa  (/)  i3  an  example  of  indirect 
modulation.  In  this  case 


y  (0  =  (/,  cos  [a,  t + 2n  $  FA  (/)  dt + <p,[ + um  (t). 


(18.90) 


Here,  it  is  evident  that 


x[t)  =F*(t). 

XtW  -SFA(/)d/-lx(/). 


(18.91) 


The  requirement  in  simple  measurement  is  to  measure  parameter  x(t)  with  minimum 
root-mean-square  error,  i.  e.,  to  insure  that  this  condition  is  met 


W)  -  U(0~Y(01*  -  mi". 


(18.92) 


where  f  (t)  —  measurement  result. 


The  requirement  in  complex  measurement  is  to  measure  with  minimum  root-mean- 
square  error,  not  parameter  x(t)  itself,  but  some  function  x,(/),linked  with  x(t) 
by  an  operator  relationship.  For  simplicity,  we  will  consider  the  operator  linear, 
i.  e.,  we  will  assume 


(18.93) 


So,  during  complex  measurement,  the  requirement  is  to  meet  this  condition 


•7(0  "  Y(0l*  =*  min, 

where  T(t)  —  measurement  result. 


(18.94) 


Consequently,  given  indirect  modulation  types  and  complex  measurement,  there 
are  three  types  of  parameters — x(t),  x^(t),  and  x  (/).  (0.  In  future,  we  will  call 

x(t)  the  primary  parameter,  x^(t)  the  direct  parameter,  and  x9  (/)  the  equivalent 
parameter. 

It  follows  from  relationships  (18.89)  and  (18.93)  that 

*,(/)“  25  (p)  jr,  (0.  (18.95) 


where 

25  (p)  -  257*  (p)  25,  (p),  ( 18 #  96) 

25r'<p)~l/$i(p)  (18.97) 

—  linear  operator  inverse  with  respect  to  25,  (p). 

It  follows  from  expressions  (18.88),  (18.94),  and  (18.95)  that,  during  indirect 
modulation  and  complex  measurement,  the  requirement  is  to  measure  with  minimum 
root-mean-sqiare  error,  not  direct  parameter  x^(t)  included  directly  in  the  signal 
function,  but  some  equivalent  parameter  x,  (0.  linked  with  x^(t)  by  linear  operator 
»(p). 

Expression  (18.95)  corresponds  to  a  general  case  when  both  indirect  /348 
modulation  and  complex  measurement  occur  simultaneously.  It  encompasses  the  following 
particular  cases: 

1.  Indirect  modulation,  but  simple  measurement.  Here,  evidently, 

25,(p)-l  and  25(p)  «257' (p).  (18.98) 

2.  Direct  modulation,  but  complex  measurement.  Here 

(P)  =*  1  and  25(p)=25,(p).  (18.99) 


ill 


3.  Direct  modulation  and  simple  measurement.  In  this  case 

S'*  (p)  -**,  (P)  =  %(p)~  1.  (18.100) 

It  follows  from  what  has  been  stated  that  the  general  case  (indirect  modulation 
and  complex  measurement)  differs  from  the  particular  case  (direct  modulation  and 
simple  measurement)  examined  in  preceding  sections  only  in  that  equivalent  parameter 
jt,  (/),  linked  with  direct  parameter  x^(t)  by  linear  operator  ®(p),  rather  than 
the  direct  parameter,  will  be  subject  to  measurement  with  minimum  root-mean-square 
error. 

We  will  clarify  which  changes  must  be  introduced  here  into  the  equivalent 
system  circuit  that  determines  potential  measurement  accuracy. 

This  circuit  may  be  represented  in  the  farm  depicted  in  Figure  18.9  for  direct 
modulation  and  simple  measurement  examined  in  preceding  sections.  The  task  of 


Figure  18.9.  (a)  —  Equivalent  receiver; 

(b)  —  Equivalent  filter. 


equivalent  optimum  receiver  IJ,  is  reproduction  of  direct  parameter  x(t)  with  the 
guarantee  of  minimum  noise  Aea  (f)  spectral  density  8m  t.  Here,  receiver  /7,  structure, 
noise  Aea  (/)  character,  and  magnitude  gm  ,  will  not  depend  on  reproduced  parameter 
x(t)  type;  only  the  structure  of  equivalent  optimum  linear  filter  and  the 
magiitude  of  the  root-mean-square  error  will  depend  on  this  parameter  (these  results 
are  valid  if  the  magnitude  of  the  root-mean-square  error  is  sufficiently  small). 

Hence,  it  follows  that,  in  the  general  case,  when  the  requirement  is  measurement 
of  equivalent  parameter  *•  (0  ,  rather  than  direct  parameter  x^(t)  itself,  with 
minimum  root-mean-sqiare  error,  potential  measurement  accuracy  may  be  determined 
with  the  aid  of  the  equivalent  circuit  depicted  in  Figure  18.10.  In  this  /349 
circuit,  receiver  /7,  and  noise  Ae„  (flare  identical  to  those  in  Figure  18.9. 


y(*hue(t,xt)+um(t) 


Xi(tl+A6n(t) 


r(ti 


Figure  18.10.  (a)  —  Equivalent  receiver; 

(b)  —  Equivalent  filter. 


Actually,  let  primary  parameter  x(t)  be  identical  in  both  circuits.  Then, 
in  the  Figure  18.9  circuit,  receiver  /7,  extracts  parameter  x(t),  while  in  the 
Figure  18.10  circuit,  receiver  /7,  extracts  parameter  x^(t)  linked  with  x(t)  by 
relationship  (18.89)  and,  consequently,  having  a  different  character.  But,  as 
noted  above,  receiver  /7,  structure  and  noise  Ae„  (/)  at  its  output  will  not  depend 
on  the  nature  of  the  measured  parameter  (if  this  parameter's  measurement  error 
is  sufficiently  small).  Therefore,  it  is  possible  to  assert  that  receiver  //, 
and  noise  Ae„(/)  in  Figure  18.10  may  have  exactly  the  same  form  as  in  Figure  18.9 
(if,  in  accordance  with  problem  conditions,  it  is  possible  to  assume  that  the 
reproduction  error  is  sufficiently  small). 

In  this  case,  filter  0«  structure  must  differ  since,  in  the  Figure  18.9  circuit 
when  noise  Ae„(Q  is  present,  the  requirement  is  to  insure  minimum  root -mean- square 
deviation  of  output  oscillation  ~jf  (t)  from  input  oscillation  x(t).  In  the  Figure 
18.10  circuit,  the  requirement  is,  given  the  same  noise  AeB(/),  to  insure  minimum 
root-mean- squa re  deviation  of  output  oscillation  7(t),  not  from  input  oscillation 
x^(t),  but  from  oscillation  *»(0  linked  with  x^(t)  by  relationship  (18.95). 

So,  in  the  general  case  under  discussion,  it  is  possible  during  determination 
of  potential  measurement  accuracy  to  assume  that  noise  Ae„  (t)  at  linear  network 
input  has  the  form  of  normal  white  noise  with  spectral  density  fin  •  determined 
from  formula  (18.31),  in  which  x  is  replaced  by  x^,  i.  e., 

a  Nt 

Sm  •  —  — - • 

(18.101) 

Here,  optimal  linear  filter  0/  structure  will  be  found  from  the  optimum  message 
x^(t)  complex  reproduction  condition,  i.  e.,  from  the  condition 


-lv(0-2)(p)*i(0r-  min. 


(18.102) 


Having  determined  filter  «£,'  structure,  it  then  is  possible  with  the  aid  of 
the  Figure  18.10  circuit  to  compute  the  magnitude  of  the  measurement  mean-square 
error,  e,1  (/)  . 

If  you  consider  that,  in  accordance  with  (18.95) 

*.(0-®(p)x,(0,  (18.103) 

then  it  is  possible  to  convert  the  Figure  18.10  equivalent  circuit  to  the  form 
depicted  in  Figure  18.11.  In  this  circuit 

Aen'  (0  =  ®  (p)  A«n  (0.  (18.104) 


Figure  18.11.  (a)  —  Equivalent  receiver; 

(b)  —  Equivalent  filter; 


while  linear  filter  4/  must  insure  that  this  condition  is  met 

w* 

VW  -  fY(0-*.(W  -  min,  (18.105) 

i.  e.,  provide  optimum  simple  reproduction  of  message  x,  (l)  arriving  at  its  input, 
given  additive  independent  noise  As£  (Q  having  the  form  of  normal  non-white  noise 
with  power  spectrum 

*;#(/) -|®(/2nO|‘*B0.  (18.106) 

In  some  cases,  the  Figure  18.10  circuit  may  tum  out  to  be  more  convenient 
for  determination  of  potential  accuracy,  while  the  Figure  18.11  circuit  is  more 
convenient  in  others. 


For  example,  let  the  requirement  be  to  find  the  potential  accuracy  of  simple 


measurement  of  frequency  Doppler  shift  F&(t)  .  In  this  case,  (p)  =  1,  while 

fc^(p)  s  1/p •  Therefore,  from  (18.96)  we  have  ^(p)  =  p.  Then,  the  result  from 
(18.101)  and  (18.106)  is 


jjjfsjf;  *d  j*  (18.107) 

where  x(t)  =  fnW  . 

For  illustration,  we  now  will  examine  computation  of  potential  measurement 
accuracy  in  a  case  when  primary  message  x(t)  is  a  normal  stationary  process  with 
known  mean  value  x(t)  and  power  spectrum  A*  (/)  ,  and  no  condition  of  physical 
realizability  is  levied  on  the  linear  networks. 

In  this  case,  filters  0%  and  W  have  constant  parameters  and  are  completely 
characterized  by  their  transfer  functions  K%  (/•)  and  *»*(/•)  ,  respectively. 

Here,  we  will  begin  from  the  Figure  18.11  circuit.  Then,  the  problem  boils 
down  to  optimum  simple  reproduction  of  message  n(Q  on  a  background  of  normal 
noise  A«n'  (0  ,  where 


Ae.'  (/)  -  0  (p)  Ae„  (Q  -  £>f‘  (p)  g,  (p)  AeB  (t) . )  (18.108) 

We  will  assume  that  operators  <**i(p)  and  (p)  are  such  that  processes 
x«(4and  A«B*  (0  may  be  considered  stationary.  Then,  potential  accuracy  in  the  steady- 
state  mode  (where  tQ  =  *~co)  is  determined  from  relationship  (2.20)  and 


•TO 


■J 


•  1  + 


&,(/) 


(18.109) 


where  g*,(f)  and  i'mti) —  unilateral  power  spectra  of  processes  *•(<)  and  Ab'„(q  , 
respectively. 


It  follows  from  (18.108)  that 


fa,  </)-!«.  I*  *.(/). 

gm  o  (/)  *  I  *>  (P*/)  I*  fa  # - 1  (M)  1 1 1  I*  fa *■ 


Therefore 


(18.110) 
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|0(W)l»faa<y 

fat 


(18.111) 


As  our  example,  we  will  examine  measurement  of  signal  frequency  Doppler  shift 
ffl(Q.  Here,  as  noted  above, 


*>t 4> (p)-p. 


(18.112) 


and  expressions  (18.108)  take  the  form 


x,(l)~x(0»  (18.113) 

4*»*(0-p4«.(0.  (18.114) 

Since  we  assumed  that  x(t)  is  a  normal  stationary  process,  then  (0  is  the 
same  type  process.  Noise  4a.  (I)  is  normal  stationary  white  noise.  Strictly  speaking, 
there  is  no  derivative  of  such  a  process.  Therefore,  in  order  to  eliminate  this 
difficulty  during  the  operation  with  noise  4a.'  (4  ,  it  should  be  assumed  that  white 
noise  4a. (0  has  a  spectrum  bound  by  some  high  frequency  /.  .  Since  selected  frequency 
/a  may  be  as  high  as  desired  (but  finite),  then  this  will  not  impact  upon  computa¬ 
tional  results  if  spectrum  g^f)  in  the  region  of  hicfi  frequencies  decreases  rapidly 
enou^i.  Here,  the  derivative,  4a.'  (i)  y  also  is  a  normal  stationary  process  and, 
consequently,  assumptions  made  during  derivation  of  formula  (18.111)  are  satisfied. 

Substituting  relationship  (18.112)  into  formula  (18.111),  we  obtain 

rs_f 


(18.115) 


where  gm  is  determined  from  formula  (18.101).  Since,  in  this  case 


“« (<.  *i) — Ut  coi  (a,/ + 2w,  (/)  4.  tp,] , 


then 


fl-4n*-^-4n*Pop, 


where  Pnp—UJ/i  —  specific  signal  mean  power.  Therefore,  formula  (18.101)  provides 

*— STiv  (w-iw) 


Relative  measurement  error  may  be  determined  from  the  formula 


£»_)(•  gm » f*  4 f 

«•*  “«I  J  .  .  *w> gm* f* 

0  +  K.(l) 

Here,  af  —  measured  parameter  x(t)  variance,  i.  e., 

9 

where  x(t)  =  ^#(0  . 

It  is  easy  to  become  convinced  that  formula  (18.115)  coincides  (precise  /352 
to  designations)  with  fornula  (7.34)  if  the  latter  is  used  for  an  FM  signal. 
Actually,  for  an  FM  signal,  Em* V)  is  determined  from  formula  (7.13),  in  which 
one  should  assume  ^ x(t)  =  x^(t).  Then,  considering  (18.101),  the  result  is 


(18.117) 


(18.118) 


(18.119) 


If,  in  addition,  you  consider  that,  in  Chapter  7,  message  x(t)  was  assumed  to 
be  normalized  so  that  xM(m  =  1,  while  this  normalization  is  missing  in  this  case, 


2 

then  message  spectra  E  (f)  and  g  (f)  must  coincide  when  uu0  =  1.  Considering 

A  A 

this  comment  and  relationship  (18.119),  the  result  is  that  formulas  (18.115)  and 
(7.34)  actually  coincide  completely. 

However,  the  formulas  obtained  in  Chapter  7  based  on  the  Kotel'nikov  theory 
applied  to  a  case  of  simple  message  reproduction  and  only  one  of  the  direct  modulation 
types — frequency  modulation.  The  material  presented  in  this  section  makes  it 
possible  to  determine  potential  accuracy  for  various  types  of  indirect  modulation 
and  during  complex  message  reproduction. 

18.5  Multichannel  Reception 

The  assumption  in  all  preceding  chapters  was  that  single  signal-plus-noise 
realization  y(t)  arrives  at  receiving  device  input  and  this  realization  will  comprise 
information  only  about  single  usable  message  x(t).  However,  in  practical  cases, 
more-complex  instances  often  will  be  encountered  in  which  1  different  messages 
*(,K0  will  be  subject  to  reproduction,  i.  e.,  the  receiving  device  must 
have  an  1-channel,  rather  than  single-channel,  output.  In  addition,  this  combination 
of  messages  [or  single  message  x(t)]  may  comprise,  not  one  input  realization  y(t), 
but  a  combination  |yo,(0.  ...» 0W(O1  of  such  realizations,  which  arrive  from  m  different 
antennas  (or  from  m  primary  message  processing  devices,  if  we  are  talking  about 
system  design  of  secondary-processing  devices),  for  example.  In  this  case,  the 
receiving  device  has  m  input  channels.  Consequently,  in  the  general  case,  the 
optimum  receiving  system  being  designed  may  comprise  m  inputs  and  1  outputs. 

The  theory  of  optimum  nonlinear  filtration  presented  above  also  permits  generalization 
for  this  more-complicated  case. 

From  the  point  of  view  of  the  principle  (idea)  involved,  such  generalization 
is  not  complicated.  Actually,  the  combination  of  input  realizations  may  be  considered 
some  vector  realization 

VW  -  . *<«>(/)},  (18.120) 

while  the  combination  of  reproduced  messages—as  some  vector  message 


x~[t)  - 'ijfiMjOi  ...»  *<»>(<)}. 


(18.121) 


It  is  possible  to  consider  the  result  of  reproduction  of  these  messages  some  /353 
vector 


VB-lT*'MO . Y<0(0).  (18.122) 

It  is  possible  to  introduce  the  concept  of  generalized  loss  function 

— ►  — ► 

/  (x,  y).  ^  t  which  designates  losses  corresponding  to  each  combination  of  vectors 
x  and  y .  Here,  it  is  possible  to  demonstrate  [127]  that,  in  the  case  of  a  gener¬ 
alized  quadratic  loss  function  analog,  as  usual,  relationship  (17.46)  determines 
optimum  receiver  structure;  the  only  requirement  is  to  replace  x,  y,  and  f  in 
it  with  x,  y,  and  f ,  respectively.  However,  reduction  of  expression  (17.46) 
to  a  form  applicable  for  practical  use,  as  noted  above,  turns  out  to  be  complicated, 
even  for  the  single-channel  case  (m  s  1,  1  =  1).  It  is  natural  that  this  complexity 
rises  sharply  during  conversion  to  a  general  case  of  m  input  and  1  output  channels. 
Several  useful  results  for  this  general  case  were  obtained  in  [127]. 

In  this  book,  we  will  restrict  ourselves  to  examination  of  a  very  common, 
but  also  very  important,  case  when  1=1,  but  the  m  signal-plus-noise  input  reali¬ 
zations  have  the  following  form: 


i^1*  (0  (f.  x)  +  umt  (0.  j 

y<3>  (()="«  (t.  *)+«,,„(/). 


(18.123) 


y«"l  (0  -  ucm  (f,  jc)  +  umm  «).  ) 


where  uci(t,  x)(l  ®  1,  .  .  .,  m)  —  precisely-known  signals  (except  for  message 
x(t),  while  u«i(t)(i  s  1,  .  .  .,  m)  —  independent  normal  white  noise  with  spectral 
densities  NQ1,  NQ2,  ...»  N^,  respectively. 

This  case  differs  from  those  in  preceding  sections  of  the  book  only  in  that 
there  is  a  requirement  to  examine  an  (18.120)-type  vector  realization,  rather 
than  8 ingle  realization  y(t).  Therefore,  formally,  all  results  obtained  above 
will  remain  valid  if  y(t)  replaces  y(t)  in  them. 


In  particular,  the  optimum  meaaage  x(t)  reproduction  system  structure  depicted 
in  Figure  18.1  takes  the  form  sheen  in  Figure  18.12.  The  only  difference  is  that 


Vmtt / 


(b)  *W] 


- -  Aue  ^ 

jWT  1—1 


Figure  18.12.  (a)  —  DIS  [Discriminator]; 

(b)  —  F^  [Filter]. 


a  combination  lyu,(/) . |f4*",(OI  of  realizations,  rather  than  one  realization  y(t), 

arrives  at  discriminator  input.  All  formulas  determining  this  system's  structure 
remain  the  same  (aa  in  a  case  where  i  :  1  as  well),  with  the  exception  that,  in 
expressions  (18.6>f(18.8),  y  should  replace  y,  i.  e.,  one  should  assume 


(18.124) 
/354 

(18.125) 


where  function  Q(y,  x,  t)  is  determined  from  the  relationship 


$  Q(y,x,  t)dt  =  -\nPs(y). 


(18.126) 


Hence,  it  follows  that  it  suffices  to  compute  the  value  of  likelihood  function 
Px(y)  in  order  to  consider  the  special  features  of  a  multichannel  case  (m  >  1) 
compared  to  a  single-channel  case  (ms  1). 

Let  the  laws  of  noise  distribution  (multidimensional  probability  densities) 
um  (0,  mm  (0  included  in  expression  (18.123)  equBl  Wt  («B  ,),  Wm(umm)  , 
respectively. 


Then,  considering  that  this  noise  is  interdependent,  while  signals  uc^(t,  *)» 


„  u  (t,  x)  for  a  given  x  are  precisely  known,  we  will  obtain 
cm 


p*G)  =  pm" . y<"))  = 

«c)  -  o- 


(18.27) 


But,  law  of  distribution  HM“mi)of  normal  white  noise  uml(t)  has  the  form 


(18.128) 


where  a  —  some  constant.  Therefore 


I T, (^—Uci)  - aexp  J— fPu> (0— «*('•  *)N*j 


(18.129) 


and  expression  (18.127)  takes  the  form 


{<  m 

—  f  2  —  1^,(1,  x)\*dt  . 

,iu  1-1 "« 


(18.130) 


Here,  from  formula  (18.126)  we  have 


f  i 

f  Q(y,x,t)dt~- lna+  \  2  jT1^0 (<>— “ci(*» 

t-n 


(18.131) 


Since  only  the  second  derivative  of  function  Q(y,  x,  t)  with  respect  to  x 
will  be  included  in  expression  (18.125),  then  it  is  possible  to  disregard  the 
term  In  a  not  depending  on  x  in  (18.131)  and  to  assume 


m 

Q(y.  x.  0  * 2  3T l*10 &  *)p. 


(18.132) 


Following  substitution  of  this  expression  into  formula  (18.125)  and  appropriate 
transforms,  we  obtain  (considering  relationship  (18.59)  and  the  smallness  of  magnitude 


where 


(18.133) 


ax  .  * 


When  m  s  1,  formula  (18.133),  as  could  be  expected,  coincides  with  corresponding 
formula  (18.31)  for  a  single-channel  system. 

Thus,  all  conclusions  drawn  for  a  single-channel  index  remain  valid  (within 
the  framework  of  the  same  assumptions)  for  a  multichannel  index  as  well,  the  only 
difference  being  that  potential  spectral  density  gmt  is  determined  from  formula 
(18.133),  rather  than  from  formula  (18.31). 

The  magnitude  of  the  root-mean-square  measurement  error  is  computed  with 
the  aid  of  the  equivalent  circuit  depicted  in  Figure  18.3c  or  from  formula  (18.13). 

As  noted  in  §  18.2,  in  a  number  of  cases,  (in  automatic  angular  coordinate 
target  tracking  systems,  for  example),  the  oscillation  at  measurement  system  input 
has  the  form 


1/(0 =«.(/.  e)+«m  (0. 

where 

8</)=jc(0-Y(0 


(18.134) 


—  error  signal  equalling  true  parameter  value  deviation  from  the  measured  value. 

Here,  all  formulas  obtained  for  potential  single-channel  measurement  accuracy 
remain  valid  if  x(t)  is  replaced  by  6  (t)  when  gat  is  computed.  It  is  possible 
analogously  to  demonstrate  that,  in  multichannel  indices,  if  oscillations  /356 
at  measuring  channel  input,  as  opposed  to  expression  (18.123),  have  the  form 


"'d  ■ 


■  “v  r»r  i*v  'v;* . * "r*  \ 

.  «.*.  «*.>*  ».  W-.*.  »  ifcV^hw 


*'»  (0=«M«.  e)+U(111(0, 


y<<">  (/)=«„„,(/,  e)  +  umm  (t). 


(18.135) 


then  the  above  results  remain  valid  if  €  replaces  x  in  expression  (18.133),  i.  e., 
if  you  assume 


where 


(18.136) 


(18.137) 


In  the  particular  case  when  the  spectral  densities  of  the  noise  at  all  measure¬ 
ment  channels  are  identical,  i.  e., 

Nn  ~  5=8  •••  ~  = 

formula  (18.136)  takes  the  form 

a  -  N* 

Suit  —  ~ —  • 

s  fl«  (18.138) 

lm  I 

In  an  even  more  particular  case  when 

—  ... ^  Bm r=  B, 

the  result  is 


where 


8 nit " 


mB 


s-pFT- 


(18.139) 
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equisiqnal  zone,  voltages  of  the  signals  at  the  input  of  measuring  channels  equal, 
respectively 


y(,,(Q-aei(f.  r)  +  um,  (/) 
(0  =  wc  i  O  4  ii m  i  (0* 


(18.140) 


where 


«ei(*.0=*tl4-f«  (01  *  CO . 


(18.141) 


*  (t)  —  angular  deviation  of  the  target  the  bearing  is  taken  on  from  the  axis 
of  the  equisignal  zone;  i»  —  coefficient  depending  on  the  shape  of  the  radiation 
pattern  of  the  antennas  and  on  the  magnitude  of  the  angular  deviation  of  the  main 
pattern  maxima  from  the  axis  of  the  equisignal  zone;  b(t)  —  signal  carrier  oscil¬ 
lation. 

Assuming  the  noise  intensity  of  both  channels  is  identical,  in  accordance 
with  formulas  (18.137)  and  (18.138)  we  obtain 


where  —  specific  mean  power  of  the  signal  b(t)  carrier  oscillation. 


(18.142) 


Given  an  m-channel  direction  finder  with  a  phased  array  antenna  system,  it 

will  comprise  m  equidistant  antenna  elements  Ht,  Ht . Hm/r  /f_,,  h . H_ml 3  (Figure 

18.13),  whose  voltages  are  supplied  correspondingly  to  m  individual  amplifier 
stages  and  then  to  the  processing  system,  which  extracts  error  signal  M4  from 
the  combination  of  received  oscillations,  i.  e.,  the  angular  deviation  of  the 
target  the  bearing  is  taken  on  u  from  axis  z  of  the  system  equisignal  zone. 

Here,  the  combination  of  oscillations  at  measurement  system  input  may  be  written 
in  the  form 
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Figure  18.13.  (a)  —  I  [Antenna  elements];  (b)  —  Ts  [Target]; 

(c)  —  To  amplifier  stages. 


0(,,(O  =  “ei(<.e)  +  «mt<<). 


(<)  =  ue  m/i  (<.*)  +  um  m/a«). 

(<.«)+“»  -I  (0. 


y<  "•/»»(/)=„*  _*/9«,  *)+«„ 

where  um±k  —  noise  at  +  k-th  channel  input. 


(18.143) 


In  future,  we  will  assume  for  simplicity  that  all  antenna  elements  and  individual 
amplifier  stages  are  identical.  Then,  spectral  densities  of  the  noise  at  the 


input  of  each  channel  are  identical  and  equal  Nq,  while  signal  voltages,  given 


slight  angular  deviations  €  ,  have  the  form 


“e±i  <f.e).t/«)co.I*+W(0±AM0l. 
ue±a("<,  (fj  cos  [to/  Vf  (0  ±  —  i)  ht  (01 . 

«e±m/V(<'.  «)•={/(<)  cos'lil  +f '  )  ± 


(18.144) 


where  h> 


nd 
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1  * 
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Expressions  (18.144)  should  be  substituted  into  formulas  (18.137)  and  (18.138) 
to  determine  potential  measurement  accuracy.  Here,  the  result  is 
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(18.145) 
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Considering  that 


Sfc  =  [£W1e,]Z=s(2A_|)t^-T- 


(18.146) 


is  specific  mean  signal  power  at  the  output  of  each  emitter,  from  (18.145)  we 
will  obtain 

8n»—  m/a 

2A»  Pc p  2  <2‘-  •)* 


m/a 

(»-!)• 


m*— m 
6 


therefore,  finally  we  obtain 


A*  Pep  («•—«) 


(18.147) 


18.6  Concluding  Comments 

Five  basic  assumptions  forming  the  foundation  of  the  theory  of  optimum  nonlinear 
filtration  were  pointed  out  in  the  beginning  of  this  chapter  (§  18.1).  We  will 
examine  how  much  and  in  what  sense  these  assumptions  may  reduce  this  theory's 
field  of  use. 

The  first  assumption  is  that  the  law  of  message  x(t)  distribution  is  normal. 


If  this  law  differs  significantly  from  the  norm,  then  this  theory  in  the 
general  case  will  be  imprecise;  in  particular,  nonlinear  smoothing  networks  (instead 
of  linear  filter  0^)  may  be  required.  However,  as  pointed  out  above,  if,  in 
accordance  with  problem  conditions,  smoothing  networks  fall  into  the  class  of 
linear  networks,  then  all  basic  results  of  the  theory  are  valid  for  an  abnormal 


law  of  message  distribution  (if  the  remaining  assumptions,  less  the  first  one, 
remain  valid). 

In  addition,  as  will  be  demonstrated  in  the  next  chapter,  the  higher  the 
message  fidelity  requirements,  the  less  message  characteristics  impact  upon  message 
fidelity.  Therefore,  one  may  consider  that  the  first  assumption  does  not  /359 
overly  restrict  this  theory's  field  of  use. 

The  second  assumption  is  that  message  x(t)  changes  slowly  in  comparison  to 
additive  noise  um  (0  and  compared  to  parasitic  signal  parameters  changing  over 
time,  i.  e.,  compared  to  signal  fluctuations. 

The  first  part  of  this  assumption  [x( t)  slowness  compared  to  «m(0]  is  satisfied 
in  a  majority  of  practical  cases.  The  slowness  of  x(t)  compared  with  signal  fluc¬ 
tuations  (if  they  exist)  occurs  in  far  from  all  practical  cases. 

For  example,  in  radar  problems  when  target  motion  parameters  are  messages, 
the  correlation  time  of  these  messages  usually  is  measured  in  unities  of  seconds 
or  more,  while  the  correlation  time  of  signal  fluctuations  are  tenths  of  seconds 
or  less.  Therefore,  the  second  assumption  usually  is  satisfied  to  a  lesser  degree 
than  is  the  first  assumption.  However,  in  a  case  of  reception  of  voice  or,  moreover, 
television  messages,  the  rate  of  message  change  is  not  less,  but  rather  is  signi¬ 
ficantly  greater,  than  the  signal  fluctuation  rate  (fading).  Therefore,  this 
theory  may  be  considered  valid  for  such  messages  only  in  the  assumption  that  signal 
fluctuations  are  completely  absent.  Such  an  assumption  contradicts  reality,  but 
makes  it  possible  to  obtain  the  potential  (theoreticallv-permissible  threshold) 
message  fidelity  value  (since  the  fact  that  a  real  signal  has  fluctuations  may 
only  lower  this  fidelity).  Consequently,  for  "fast"  messages,  this  theory  makes 
it  impossible  to  f .  id  optimum  receiver  structure  and  applies  only  to  potential 
message  fidelity  estimation. 

The  third  assumption  (concerning  high  message  fidelity)  in  a  majority  of 
practical  cases  may  be  considered  satisfied,  at  least  in  the  first  approximation. 

The  fourth  assumption  is  that  modulation  of  signal  ue  ( l‘>  *>  «i*  •••»  «m)by  message 
x(t)  is  direct. 


This  assumption  is  invalid  for  several  important  cases,  frequency  modulation 
in  particular.  However,  as  shown  in  §  18.4,  it  is  possible  to  generalize  the 
theory  for  cases  of  indirect  modulation  as  well  when  determining  potential  message 
fidelity. 

The  fifth  assumption  (concerning  the  stationary  nature  of  signal  fluctuations) 
in  a  majority  of  practical  problems  may  be  considered  valid,  at  least  in  the  first 
approximation.  In  addition,  the  Gauss  approximation  method  remains  valid  also 
when  this  assumption  is  removed,  although  computations  and  results  here  are  com¬ 
plicated  considerably  (see  [127],  for  example). 

It  follows  from  what  has  been  said  >that,  although  the  assumptions  made  con¬ 
siderably  reduce  the  generality  of  the  theory  examined  in  this  chapter,  its  domain 
remains  very  broad,  especially  when  determining  potential  message  fidelity. 
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CHAPTER  NINETEEN 


/360 


IMPACT  OF  OPTIMIZATIONS  AND  A  PRIORI  MESSAGE  DISTRIBUTIONS  ON  OPTIMUM  RECEIVER 
STRUCTURE  AND  PROPERTIES 

19.1  General  Comments 

In  accordance  with  the  theory  presented  above,  finding  the  structure  of  the 
optimum  receiver  and  determining  its  noise  immunity  require  precise  knowledge 
of  the  a  priori  signal  and  noise  distribution*  and  selection  of  the  corresponding 
optimization,  i.  e.,  the  loss  function  /  (x,  y)  type.  However,  significant  dif¬ 
ficulties  arise  here  since  a  priori  distributions  usually  are  not  precisely  known 
and  unambiguous  selection  of  the  specific  loss  function  type  is  not  always  possible. 

Initially,  we  will  examine  the  a  priori  signal  and  noise  distribution  problem. 

The  a  priori  distribution  of  the  basic  noise  type—  intemel  receiver  noise, 
is  precisely  known.  The  a  priori  distribution  of  most  other  noise  of  natural 
origin  (atmospherics,  background  reflections,  and  so  on)  also  usually  is  known 
or  may  be  approximated  from  preliminary  perennial  research.  On  the  other  hand, 


*A  priori  data  may  be  required  also  for  selection  of  anticipated  signal  strength 
and  of  other  characteristic  problem  parameters. 
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the  a  priori  distribution  of  organized  noise  turns  out  to  be  unknown  completely 
or  to  a  sign if cant  degree. 

A  priori  signal  distribution  is  determined  from  usable  message  distribution 
P(x)  and  parasitic  parameter  distribution  P  (o„  ....  om) .  (Usually,  one  may  assume 
that  message  x  and  the  parasitic  parameters  mutually  are  statistically  independent). 

In  most  cases,  distribution  P  (oet,  ....  am)  is  know,  at  least  approximately. 

For  example,  it  is  known  that,  when  receiving  a  signal  reflected  from  an  aircraft, 
the  initial  phase  has  a  uniform  distribution,  while  it  is  possible  in  the  first 
approximation  to  assume  that  the  amplitude  is  distributed  in  accordance  with 
Rayleigh's  law.  Here,  by  virtue  of  the  accumulation  of  experimental  data,  the 
law  of  amplitude  distribution  will  become  known  with  ever-greater  accuracy. 

A  priori  message  P(x)  distribution  in  several  technical  cases  is  known  or 
may  be  determined  with  sufficient  accuracy  from  experimental  material,  which  already 
is  available  or  which  may  be  accumulated  in  the  near  future.  This  occurs,  for 
instance,  in  civilian  radio  communications  and  during  radio  and  TV  broadcasts. 
However,  in  many  cases,  the  a  priori  message  distribution  is  unknown  and,  in 
principle,  may  not  be  found  due  to  the  absence  of  preliminary  similar  situations, 
know-how  from  which  could  be  put  to  use.  This  happens,  for  example,  in  /361 
military  radar  and  during  mensuration  involving  the  study  of  new  physical  phenomena. 

Thus,  in  a  majority  of  cases,  a  priori  signal  and  noise  distributions  are 
not  precisely  known,  with  this  applying  in  particular  to  a  priori  message  distri¬ 
butions.  However,  it  turns  out  that,  in  many  important  cases,  optimum  receiver 
structure  and  properties  essentially  will  not  depend  on  a  priori  distributions 
of  messages  and  certain  parasitic  signal  parameters.  Thus,  for  instance,  it  was 
pointed  out  above  many  times  that,  given  noise  in  the  form  of  additive  normal 
white  noise  and  a  high  signal-to-noise  ratio,  the  noise  immunity  of  the  optimum 
receiver,  given  a  signal  with  an  equiprobable  random  initial  phase,  turns  out 
to  be  identical  to  that  for  a  precisely-known  signal.  This  signifies  that,  under 
the  conditions  stipulated,  the  law  of  signal  initial  phase  <p  distribution  does 
not  impact  on  optimum  receiver  properties. 


It  will  be  demonstrated  below  that,  under  these  conditions  (given  noise  in 


the  form  of  additive  normal  white  noise  and  high  signal-to-noise  ratio),  the  type 
of  a  priori  message  P(x)  distribution  also  essentially  does  not  impact  upon  optimum 
receiver  structure  and  properties,  at  least  in  a  case  of  individual  analog  message 
value  reception. 

We  now  will  examine  the  problem  of  selecting  a  specific  type  of  loss  function 
I  (■*.  Y)  .  It  follows  from  material  in  the  preceding  chapter  that,  when  receiving 
discrete  messages,  loss  function  selection  boil?,  down  to  selection  of  the  weights 
of  various  error  types  (false  alarm  and  signal  miss  weights,  for  example).  In 
several  cases,  selection  of  these  weights  does  not  present  difficulties.  T^us, 
for  instance,  during  transmission  of  an  alphabet  with  equiprobable  letters  lacking 
a  statistical  link  among  them,  it  usually  is  possible  to  assume  that  the  weight 
of  all  errors  is  identical,  i.  e. ,  to  consider  that  it  is  identically-dangerous 
to  receive  any  other  letter  of  the  alphabet  instead  of  a  given  letter.  However, 
in  many  cases,  the  physical  crux  of  the  problem  does  not  make  it  possible  to  select 
error  weight  so  that  there  is  confidence  that  this  weight  selection  is  the  best 
or  the  only  possible  one. 

In  a  case  of  simple  reproduction  of  analog  messages,  as  already  noted  above, 
a  quadratic  loss  function  of  the  following  type  usually  is  selected  as  the  loss 
function 

Hx,  v)  »(?-*)». 

However,  in  many  cases,  it  is  far  from  evident  that  such  a  loss  function 
is  the  best  or  the  only  correct  one.  For  instance,  the  advantages  or  disadvantages 
of  this  loss  function  compared  to  a  function  of  the  type  /  (*,  y)  «*  |y  —  *|  and  so 
on  may  be  unclear. 

Thus,  there  is  considerable  ambiguity  in  selection  of  the  type  of  a  priori 
di8tribution8  and  loss  functions  in  a  majority  of  cases  during  the  design  of  an 
optimum  receiver  and  evaluation  of  its  quality.  It  also  is  evident  that  selection 
of  a  particular  loss  function  type  in  turn  is  based  on  certain  a  priori  data. 

Thus,  for  example,  if  it  is  known  a  priori  that  an  alphabet  is  the  message,  /362 
then  this  provides  the  basis  to  select  identical  weights  for  various  errors. 

If  it  is  known  that  music  or  speech  is  the  message,  a  quadratic  loss  function 


is  advisable  in  both  cases.  However,  a  priori  data  never  may  be  comprehensive 
and  a  considerable  degree  of  ambiguity  (randomness)  remains  in  selection  of  the 
loss  function  type,  even  in  the  best  cases.  Therefore,  it  is  very  important  to 
clarify  the  degree  to  which  this  ambiguity  may  impact  upon  optimum  receiver  structure 
and  properties.  Study  of  the  problem  is  one  of  the  basic  tasks  of  the  contemporary 
theory  of  optimum  reception  methods. 

It  is  shewn  in  succeeding  sections  that,  given  noise  in  the  form  of  additive 
normal  white  noise  (and  not  only  that  type)  and  high  requirements  for  message 
fidelity  (reliability),  the  structure  of  the  optimum  receiver  and  its  properties 
essentially  will  not  depend  on  the  type  of  a  priori  message  distribution  and  on 
the  optimization  selected  for  a  broad  class  of  these  distributions  and  criteria. 
Therefore,  in  the  quest  for  the  optimum  receiver  and  study  of  its  properties, 
it  is  permissible  in  many  cases  to  begin  with  the  simplest  (uniform)  law  of  message 
distribution  and  from  one  of  the  simplest  optimizations. 

19.2  Certain  Properties  of  Distribution  P  (x)  During  Reception  of  Individual 
Analog  Message  Values  y 

The  optimum  receiver  must  reproduce  message  x  in  the  best-possible  manner 
based  on  analysis  of  signal-plus-noise  realization  y(t).  For  a  given  y(t),  all 
usable  information  on  message  x  will  be  contained  in  distribution  Py(x) ;  therefore, 
optimum  receiver  structure  and  properties  are  determined  primarily  by  the  distribution 
Py(x)  for  any  optimization. 

Thus,  for  example,  in  the  case  of  the  maximum  inverse  probability  criterion 
[condition  (17.35)],  the  task  of  the  optimum  receiver  is  simply  to  find  the  dis¬ 
tribution  Py(x)  maximum  (with  respect  to  x)  and,  using  the  generalized  minimum 
average  risk  criterion  [condition  (17.32)],  receiver  structure  and  properties 
also  are  determined  primarily  by  distribution  Py(x)  type. 

Therefore,  we  must  first  explain  basic  distribution  Py(x)  properties  in  order 
to  analyze  the  influence  of  a  priori  distribution  P(x)  and  loss  function  /  (jc,  v) 


i 


We  will  examine  these  properties  relative  to  a  case  of  simple  reproduction 
of  individual  analog  message  values,  when  distribution  P  (x)  is  a  unidimensional 
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probability  density  and  therefore  easily  is  depicted  graphically  (Figure  19.1). 
Here,  we  will  assume,  as  was  the  case  in  Parts  I  and  II  of  the  book,  that  the 
message  is  normalized  so  that 


(19.1) 


This  normalization  does  not  disrupt  the  generality  of  the  analysis,  /363 
while,  for  any  actual  message  T  change  (finite)  bounds  tmm~  TMam  »  it  is  possible, 
having  assumed 


where 


T  =  *•  +  *!*. 


2  ’  1“  2 


(19.2) 


to  obtain  normalized  message  x  change  bounds  equalling  +  1.  Here,  T ’  and  T ^ 
are  known  magnitudes  and  transform  (19.2)  denotes  only  coordinate  origin  displacement 
and  introduction  of  a  specfic  scale  with  respect  to  the  X-axis. 


The  type  of  curve  Py(x)  for  different  realizations  y(t)  (y  a  y^  and  y  ^  y^ 
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in  Figure  19. 1,  for  example),  in  the  general  case  may  vary,  while  the  area  is 
determined  by  normality  condition 

i 

j  P,(x)dx=l,  (19.3) 

i.  e. ,  always  equals  uiity. 

We  will  assume  that,  when  there  is  no  noise,  message  x  will  be  reproduced 
precisely,  i.  e. ,  message  reproduction  errors  are  linked  exclusively  with  noise 
activity.  This  signifies,  in  particular,  that,  if  the  signal  has  parasitic  random 
parameters  (a,,  ....  am)  ,  then  the  nature  of  these  parameters  is  such  that  it  does 
not  hinder,  in  principle,  accurate  message  mensuration  when  noise  is  absent. 

Such  parasitic  parameters  in  future  for  brevity  will  be  called  safe  parasitic 
parameters. 

For  instance,  let 

«e(/)»at(l+*)cos(»/+«p).  (19.4) 

where  <p  —  equally-probable  initial  phase;  while  aQ  and  (*/  —  precisely-known 
parameters. 

Evidently,  in  this  case,  parasitic  parameter  q>  is  safe  since,  when  there 
is  no  noise,  it  is  possible  through  amplitude  detection  to  extract  envelope 
aQ(l  +  x)  and,  consequently,  in  principle  to  measure  message  x  accurately. 

Since  the  message  will  be  reproduced  accurately  when  noise  is  absent,  dis¬ 
tribution  Py(x)  here  has  the  form  of  delta-function  d(x  —  *■)»  where  xM  —  true 
message  value  (Figure  19.1b). 

Since  reproduction  error  in  this  example  equals  zero,  we  will  call  this  /364 
a  case  of  zero  error  and  designate* 

8»-0. 

*In  expression  (19.5),  6  denotes  error,  rather  than  a  delta-function. 
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(19.5) 


Consequently,  distribution  Py(x)  has  the  form  of  a  delta-function  in  the 
case  of  zero  error. 

In  a  real  case,  i.  e.,  given  fluctuating  noise,  the  error  does  not  equal 
zero.  However,  given  known  conditions,  it  is  sufficiently  slight,  i.  e., 

5*«  1.  (19.6) 

It  follows  from  what  has  been  stated  above  that,  when  6* -*•<)  ,  distribution 
Py(x)  will  strive  towards  a  delta- function.  Therefore,  given  a  finite  but  slight- 
enough  error,  this  distribution  must  have  a  form  close  to  a  delta-function  (Figure 
19.1c),  i.  e. ,  curve  Py(x)  in  this  case  has  a  significant  value  only  within  very- 
narrow  interval  Axy  around  the  most-probable  value  xvn  .  Here,  index  y  denotes 
that  magnitudes  Axy  and  xw  in  the  general  case  will  depend  on  realization  y(t) 
type.  However,  due  to  the  error  decrease,  magnitude  Axy  asymptotically  will  strive 
towards  zero  essentially  for  all  possible  realizations  y(t)  (since,  otherwise, 
error  &  would  not  strive  towards  zero). 

Since  error  5*  asymptotically  will  strive  towards  zero  given  an  unlimited 
rise  in  signal-to-noise  ratio  q,  instead  of  (19.5)  and  (19.6),  it  is  possible 
to  assume,  respectively 

(19.7) 

and 

*»*•  (19.8) 

In  the  first  case  (where  q  — *oo)f  distribution  Py(x)  is  converted  into  a 
delta- function,  while,  in  the  second  (q  3>  1),  it  has  significant  values  only 
within  very  slight  interval  Axy. 

It  is  possible  to  use  the  results  presented  in  Parts  II  and  III  for  noise 
in  the  form  of  additive  normal  white  noise,  valid  for  a  high  signal-to-noise  ratio 
(q  ^  1),  to  illustrate  these  general  postulations  about  the  function  P  (x)  type. 


For  a  precisely-known  signal,  distribution  P  (x)  is  described  by  formula 

(6.12):  y 


p«(*)=  |/  ~e~b(*~xv  ")*, 


(6.12) 


while  the  mean-square  error  equals 


*-5- 


(6.17) 


where 


In  a  case  of  amplitude  modulation 


b  _  ^9*  —  m». 


(6.22) 


For  frequency  or  pulse-position  modulation,  as  follows  from  (6.17),  (6.26b), 
and  (6.40),  the  result  is 


12  N,  12 


(19.9) 


It  follows  from  these  relationships  that,  where  5*-*-0  or  q — ••  co,  distribution 
Py(x)  actually  will  strive  towards  a  delta-function.  In  addition,  in  the  cases 

t  |* 

Jlvl/li — — 

-f  Xy,»  0  Xqtn  f  'X 

(a)  .  • 


•t  *9,*  9  'p  X 

G>) 

Figure  19.2 


examined,  distribution  Py(x)  has  several  additional  special  features,  which  are 
important  for  further  analysis: 

a)  distribution  Py(x)  is  a  single-humped  curve  symmetrical  relative  to 
most-probable  value  xHn  (Figure  19.2a); 

b)  the  type  of  curve  Py(x)  for  various  realizations  y(t)  remains  unchanged. 

When  y  changes,  only  a  displacement  along  the  X-axis  of  the  position  of  the  curve's 
maximum  caused  by  the  dependence  of  y  on  most-probable  value  xun  will  occur  [in 

the  cases  examined,  evidently  xn  =»  ,  while  true  value  xn  may  be  anything  ranging 

from  —1  to  1], 

Postulations  a)  and  b)  are  valid  for  all  realizations  y,  with  the  exception 
of  those  which  correspond  to  values  *n  ,  very  close  to  +  1. 

If  xra  is  close  to  +  (Figure  19.2b),  distribution  Py(x)  will  become  asymmetrical 
and  will  depend  on  y.  Such  a  distribution  Py(x)  distortion  occurs  because  the 
probability  of  message  x  values  falling  into  the  +  1  range  must  equal  zero.  However, 
if  q  >1  (or  error  J1  is  very  slight),  then  curve  Py(x)  has  a  very  narrow  peak 
and  its  significant  distortion  may  occur  only  for  xym  values  very  close  to  the 
boundaries.  The  probability  of  such  values  is  very  slight.  Actually,  in  the 
case  being  examined  of  very-slight  errors  for  the  overwhelming  majority  of  reali¬ 
zations  y(t),  most-probable  message  value  xn  must  be  very  close  to  its  true  value 
xm  .  Therefore,  if  a  priori  distribution  P(x)  does  not  have  sharp  peaks  around 
the  boundary  values  (x  %  +  1),  then  the  probability  of  the  appearance  of  message 
value  xn  very  close  to  one  of  the  boundary  values  is  very  slight.  Consequently, 
the  probability  that  most-probable  mesaage  value  xn  will  turn  out  to  be  close 
to  one  of  the  boundaries  also  is  very  slight. 

Thus,  with  the  exception  of  special  cases  [in  which  a  priori  distribution 
P(x)  has  sharp  maxima  around  boundary  values  (x  s  +  1)],  it  is  possible  to  assume 
that  properties  a)  and  b)  occur,  i.  e.,  distribution  Py(x)  is  a  single-humped  /366 
curve  symmetrical  relative  to  mo8t-probable  value  xn  and,  this  curve,  (without 
distortions)  will  displace  along  the  X-axis  only  when  y  changes  (Figure  19.2a). 

Kotel'nikov  obtained  formulas  (6.12)  and  (6.17)  and  drew  conclusions  from 
them  for  precisely-known  signals.  However,  it  was  demonstrated  in  Part  III  that, 


given  a  high  signal-to-noise  ratio,  they  remain  valid  also  for  random  initial 
phase  signals. 


So,  distribution  Py(x)  is  a  symmetrical  single-humped  curve  essentially  for 
all  y,  given  additive  noise  in  the  form  of  normal  white  noise  and  a  high  signal- 
to-noise  ratio  (q  >1). 


At  first  glance,  it  may  be  demonstrated  that,  since,  where  q  — ►  oo, 
distribution  Py(x)  will  strive  towards  a  delta-function,  when  q  increases,  it 
must  strive  towards  a  single-humped  symmetrical  curve  for  any  fluctuating  noise, 
but  not  only  in  the  case  of  normal  white  noise.  However,  in  actuality,  given 
some  forms  of  fluctuating  noise,  distribution  Py(x)  may  remain  asymmetrical  or 
not  single-humped  for  a  signal-to-noise  ratio  as  high  as  desired  (but  finite). 


The  proof  of  this  postulation  was  provided  in  [125].  It  was  demonstrated 
here  that,  given  a  high  signal-to-noise  ratio,  distribution  Py(x)  may  be  (essentially 
for  all  y)  a  single-humped  curve  symmeterical  relative  to  the  most-probable  value, 
not  only  for  noise  in  the  form  of  normal  white  noise,  but  for  some  other  types 
of  noise  as  well. 

These  conclusions  are  of  great  significance  when  analyzing  the  influence 
of  optimizations,  as  will  be  shown  in  the  next  section. 

19.3  Impact  of  A  Priori  Message  Distributions  and  Optimizations  During  Reception 
of  Individual  Analog  Message  Values 


1.  Impact  of  A  Priori  Message  Py(x)  Distribution 

I  ' 

In  the  general  case,  distribution  Py(x)  will  depend  on  a  priori  message  P(x) 
distribution  since 

(*)"*»  P  (•*)  P *  (y),  (19.10) 

where  ky^  —  coefficient  determined  from  normality  condition  /367 


J  P,(x)dx=  I. 

-i 
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(19.11) 


Let  the  signal-to-noise  ratio  be  so  high  that  distribution  P^(x)  has  a  signi¬ 
ficant  value  only  within  the  range  of  a  very-narrow  region  with  width  Ax^,  (Figure 
19.1c).  We  will  clarify  what  conditions  distribution  P(x)  must  meet  so  its  form 
essentially  will  not  impact  upon  the  shape  of  curve  P^(x). 

In  this  case,  likelihood  function  P^Cy >  in  accordance  with  (19.10)  must  have, 
accurate  to  some  constant  factor,  a  shape  exactly  like  that  of  Py(x).  Therefore, 


Figure  19.3 


function  Py(x)  depicted  in  Figure  19.1c  in  this  case  thus  will  be  similar  in  shape 
to  likelihood  function  Px(y)  depicted  in  Figure  19.3.  A  priori  distribution  P(x) 
is  shown  by  the  dotted  line  in  this  figure. 

It  is  easy  to  become  convinced  from  comparison  of  Figure  19.1c  and  19.3  and 
considering  relationship  (19.10)  that  curve  P^(x)  shape  will  not  depend  on  the 
distribution  P(x)  type  if  function  P(x)  is  sufficiently-smooth  in  the  sense  that, 

for  any  functions  P(x),  condition  +  e  is  satisfied  for  all  x 

ranging  from  —1  to  1,  (19.12),  where  €  —  positive  magnitude  significantly  less 
than  unity,  such  as  €  s  0.1. 

In  other  words,  any  possible  a  priori  distribution  must  be  so  smooth  t^at, 
in  very-slight  interval  Axy,  function  P(x)  changes  slightly,  not  more  than  ^  10S», 
for  example. 

When  the  signal-to-noise  ratio  rises,  magnitude  Ax^  included  in  condition 
(19.12)  asymptotically  will  strive  towards  zero.  Therefore,  the  greater  the 
signal-to-noise  ratio,  the  broader  the  class  of  a  priori  distributions  P(x)  for 


A 


which  this  condition  is  met.  Consequently,  given  a  sufficiently-high  signal-to-noise 
ratio,  function  P^(x),  and,  hence,  optimum  receiver  structure  and  properties, 
will  not  depend  on  the  a  priori  distribution  P(x)  type  for  a  very-broad  class 
of  these  distributions. 

But,  the  greater  the  minimum  signal-to-noise  ratio  value,  the  higher  the 
message  fidelity  requirements.  Therefore,  it  is  possible  to  formulate  the  afore¬ 
mentioned  conclusion  also  in  the  following  manner: 

Given  sufficiently-high  message  fidelity  requirements  (slight  8*  ),  optimum 
receiver  structure  and  its  properties  will  not  depend  on  the  type  of  a  priori  /368 
message  distribution  for  a  very-broad  class  of  these  distributions.  Hence,  it 
follows  that,  given  the  aforementioned  conditions,  during  optimum  receiver  design 
and  analysis  of  its  properties,  it  is  possible  to  start  from  the  simplest  a  priori 
message  distribution  type,  i.  e. ,  to  assume 

P(x)  s  const. 

2.  Optimization  Impact 

We  will  demonstrate  that,  given  high  message  fidelity  requirements  (for  a 
high  signal-to-noise  ratio),  a  broad  class  of  optimizations  exists,  which  leads 
to  identical  optimum  receiver  structure  and  properties. 

For  proof,  we  will  begin  from  the  generalized  statistical  criterion  presented 
in  the  preceding  chapter— the  minimum  average  risk  R  criterion: 

Average  risk  R  is  determined  from  formulas  (17.30)  and  (17.31),  namely 


where 


P(y)Rrdy, 

"  M 


R„  =  f  P„  (*)/(*>  F(y)]dx- 

X, 


(17.30) 


(17.31) 


As  indicated  above,  finding  the  optimum  receiver  structure  will  boil  down 
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from  the  mathematical  point  of  view  to  finding  that  type  of  operator  I  (y)  in 
which  magnitude  R  is  minimum.  Various  optimizations  correspond  here  to  different 
loss  function  /  (x,  y)  types. 

We  will  begin  with  analysis  of  expression  (17.31).  We  will  assume  that, 
for  any  y,  function  Py(x)  is  a  single-humped  curve  symmetrical  relative  to  the 
most-probable  value. 

As  was  demonstrated  above,  such  an  assumption  is  valid  for  a  high  signal-to-noise 
ratio  for  normal  white  noise  and  some  other  noise  types.  On  the  other  hand,  it 
is  possible  to  show  [125]  that,  for  some  noise  types,  curve  Py(x)  remains  signficantly 
asymmetrical  or  not  single-humped  for  a  signal-to-noise  ratio  as  high  as  desired 
(but  finite).  Thus,  our  assumption  that  distribution  Py(x)  is,  for  all  y,  a 
single-humped  curve  symmetrical  relative  to  xn  is  valid  (given  a  high  signal-to-noise 
ratio)  not  always,  but,  in  any  case,  valid  for  normal  white  noise  and  not  only 
for  that. 

In  future,  we  will  consider  this  assumption  valid. 

Let  the  receiver  be  built  in  accordance  with  the  maximum  inverse  probability 
density  criterion,  i.  e. ,  it  supplies  always  that  value  x  which  is  the  most  /369 
probable  for  a  given  y.  This  signifies  that,  in  such  a  receiver,  this  always 
is  the  case 

(y)  —  ir 


Conditional  risk  R  in  such  a  receiver  equals 


R,  -  J  P,  (^)  /  (*,  x9  „)  dx. 


Ue  will  explain  which  loss  function  l  (*,  V)  types  apply  for  this  receiver 
operating  principle  to  be  optimum,  i.  e.,  provides  the  minimum  Ry  value  for  any  y. 


Function  /  (*,  y)  type  will  depend  on  the  optimization  selected.  However, 
for  rational  criteria,  this  function  usually  has  the  following  general  properties: 


This  denotes  thst  any  receiver  error  is  considered  a  loss  (deficit). 


2.  I(x,  x)  =  0  (19.14) 

i.  e. ,  the  loss  corresponding  to  a  precise  decision  (Y  =  *)  equals  zero. 

3.  Function  /  (•*.  Y)  rises  monotonically  with  respect  to  both  sides  from 

point  x  =  (i.  e. ,  from  point  1  =  0). 

4.  In  the  highly-accurate  systems  we  are  examining,  when  possible  errors 

(V  —  *)  must  be  very  slight,  it  is  most  natural  to  consider  the  weights  of  positive 
and  negative  errors  to  be  identical.  Therefore,  in  such  systems,  the  loss  function 
selected  usually  is  symmetrical  and  nondecreasing.  In  addition,  the  assumption 
most  of  the  time  is  that  the  loss  is  a  function  only  of  the  error  modulus 
(Y  —  •*).  i.  e., 


/ (X,  V)  - / (x-y)  - /  ( y-x ),  (19.15) 

where  I  (y  —  x)  —  nondecreasing  function  of  error  (y  —  x)  . 

F  unctions  of  the  type  (?  —  •*)*  and  |?  —  -*1  ,  corresponding  to  well-known  criteria 
of  minimum  root-mean-square  error  and  minimum  error  modulus,  are  examples  of  loss 
functions  satisfying  conditions  (19.13) — (19.15). 

We  now  will  demonstrate  that,  given  a  single-humped  distribution  Py(x),  the 
maximum  inverse  probability  density  criterion  is  optimum  for  all  loss  function 
I  (•*.  Y)  types  satisfying  conditions  (19.13) — (19.15). 

A  symmetrical  single-humped  distribution  P^(x)  and  two  postulations  of 
(19.15)-type  loss  function  I  i*,y)  are  depicted  in  Figure  19.4a.  The  function 
/  (x,  y)  postulation  in  a  receiver  operating  on  the  maximum  inverse  probability 
principle  is  depicted  by  a  solid  line.  Here,  ?***»,  .  The  function  /  (.r,  y) 
postulation  in  a  receiver  operating  on  some  other  principle  is  depicted  by  the 
dotted  line. 


*  s 


PyfXlllX.n, 


Figure  19.4.  (d)  —  Section. 


In  this  case,  Y  —  ■***  + m  (y),  where  magnitude  m(y)  characterizes  the  deviation 
from  principle  Y  *“  *»■  . 

Values  of  a  derivative  of  functions  Py(x)  and  /(*,?)  are  presented  /370 
in  Figure  19.4b.  The  areas  formed  by  curves  1  and  2  equal  the  values  of  conditional 
risk  Ry  for  functions  /  (x,  xn)  and  /  (*,  xn  +  m)  ,  respectively  [see  formula  (17.31)] 

We  will  show  that  a  larger  area  corresponds  to  curve  2  than  to  curve  1. 

For  proof,  we  will  compare  the  areas  individually  for  the  following  three  sections: 

1.  Areas  located  where  x^x9K  . 

2.  Areas  located  where  *■•  *,*-*-?. 

3.  Areas  located  where  . 

In  the  first  section,  the  ares  corresponding  to  curve  2  always  is  greater 
some  positive  magnitude  since,  in  this  section,  curve  2  (Figure  19.4a) 
located  4>ove  curve  1. 

In  the  third  section,  the  arse  corresponding  to  curve  2  always  is  less  by 
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some  positive  value  AQ^  since,  here,  curve  2  (Figure  19.4a)  is  below  curve  1 
everywhere.  But,  AQ^  >  AQ^  always  is  the  case.  Actually,  it  is  evident  from 
Figure  19.4a  and  19.4c  that,  in  the  first  and  third  sections,  the  dependence  of 
the  difference  of  ordinates  AI  of  curves  1  and  2  on  x  is  symmetrical  relative 
to  value  x  =  x  .  Therefore,  two  differences  AI.  and  AI_,  taken  equal  distances 
x  from  the  boundaries  of  the  first  and  third  sections  (Figure  19.4a  and  19.4c), 
are  equal  among  themselves,  but  function  Py(x)  at  the  point  corresponding  to  AI^ 
is  greater  than  at  the  point  where  the  ordinate  difference  equals  AIZ  (Figure 
19.4c).  This  occurs  for  any  value  (5  x  within  the  first  and  third  sections. 
Therefore,  it  is  evident  that  a  change  in  the  area  formed  by  curve  Pv  (x)  I(x,  y) causing 
a  shift  from  function  /  (*,  to  function  /  (*,  *n  +  m)will,  in  the  first  section, 
always  be  greater  (with  respect  to  absolute  magnitude)  than  in  the  third  section, 
i.  e. ,  actually  AQ^  >  AQ^. 

In  the  second  section  ix  ■*  **»  -r  y) ,  the  area  formed  by  curve  2  is  greater 
by  positive  magnitude  AQ^  than  the  area  formed  by  curve  1.  Actually,  curves  1 
and  2  (Figure  19.4a)  in  this  section  are  located  symmetrically  relative  to  the 
mean  abcissa  x  ,  but  the  curve  2  greater  ordinate  values  correspond  to  greater 
function  Py(x)  ordinate  values.  Therefore,  following  multiplication  of  the  cor¬ 
responding  ordinates,  the  area  formed  by  curve  Pi  (*)  ?  (*,  y) ,  in  the  case  /371 
of  curve  2,  ends  up  greater.  Thus,  the  overall  area  increment  equals 

AQa  +  AQZ  —  AQj  '>  0, 

and  magnitude  Ry  rises  during  the  shift  from  m  =  0  to  m  /  0. 

Consequently,  conditional  risk  Ry  is  minimal  where  m  =  0,  i.  e. ,  when  decision 
7"  selected  each  time  equals  most-probable  value  . 

Thus,  it  is  demonstrated  that,  given  the  aforementioned  assumptions  about 
function  Py(x)  and  /  ( x ,  y)  type,  a  receiver  operating  in  accordance  with  the 
maximum  inverse  probability  density  principle  (V  =*  *»«) provides  minimum  conditional 
risk  R  for  any  y  and,  consequently,  minimum  average  risk  as  well  [see  formula 
(17.30^]. 

Hence,  it  follows  that,  in  those  cases  when  distribution  Py(x)  may  be  considered 
a  single-humped  curve  symmetrical  relative  to  the  most-probable  value  for  all 


y,  a  receiver  operating  in  accordance  with  the  maximum  inverse  probability  density 
principle  is  optimum  (i.  e.,  which  insures  minimum  average  risk)  for  all  loss 
functions  satisfying  conditions  (19.13) — 19.15).  In  particular,  this  occurs  given 
a  high  signal- to-noise  ratio  for  normal  white  noise  and  not  only  for  such  noise. 

It  is  easy  to  demonstrate  (see  (125],  for  example)  that,  in  those  cases  when 
distribution  P^(x)  or  loss  function  /  (x,  y)  have  no  symmetry,  a  receiver  operating 
on  the  maximum  inverse  probability  density  principle  is  not  optimum  (i.  e.,  does 
not  provide  minimum  average  risk),  while  optimum  receiver  structure  will  depend 
on  specific  loss  function  type. 

Based  on  the  aforementioned  analysis,  it  is  possible  to  draw  the  following 
conclusions  valid  for  precisely-known  signals  or  those  with  safe  random  parameters: 

1.  Given  high  message  fidelity,  optimum  receiver  structure  and  properties 
essentially  will  not  depend  on  the  type  of  a  priori  distribution  P(x).  Therefore, 
when  seeking  the  optimum  receiver  and  evaluating  its  properties,  given  the  afore¬ 
mentioned  conditions,  it  is  possible  to  assume  that  distribution  P(x)  is  the  simplest, 
i.  e.,  uniform. 

2.  If  distribution  Py(x)  may  be  considered,  for  all  y,  a  single-humped  curve 
symmetrical  relative  to  most-probable  value  xn  (which  occurs,  in  particular, 

for  normal  white  noise  and  same  other  noise,  given  a  high  signal-to-noise  ratio), 
then  optimum  receiver  structure  will  not  depend  on  the  accepted  optimization  during 
the  search  for  a  broad  class  of  these  criteria.  All  criteria  based  on  average 
risk  minimization,  for  which  the  loss  functions  satisfy  conditions  (19.13)— (19.15), 
fall  into  this  class. 

These  conditions  are  sufficiently  general  and  the  corresponding  optimization 
class  is  sufficiently  broad— it  includes,  in  particular,  criteria  of  maximum  inverse 
probability  density,  maximum  likelihood,  minimum  mean  value  of  any  even  degree 
of  error,  minimum  mean  value  of  any  odd  degree  of  the  error  modulus  (minimum  /372 
of  (T  —  *)**  and  |y  —  xj**-i  ,  respectively),  and  others. 

3.  It  follows  from  point  2  that,  when  seeking  the  optimum  receiver  structure, 
it  is  possible  under  these  conditions  to  start  from  any  simplest  criterion  which 


falls  into  the  aforementioned  class,  the  maximum  inverse  probability  density  criterion 
or  of  the  maximum  root -mean-square  error  criterion,  for  example. 

4.  It  follows  from  the  aforementioned  points  that  all  results  of  the  analysis 
of  reception  of  individual  analog  message  values  presented  in  Parts  II  and  III 
of  this  bock  for  normal  white  noise,  stemming  from  uniform  a  priori  distribution 
P(x)  and  the  maximum  inverse  probability  density  criterion,  remain  valid  for  a 
broad  class  of  a  priori  distributions  P(x)  and  optimizations,  if  the  following 
conditions  are  met: 

a)  the  signal  is  precisely  known  or  has  safe  random  parameters; 

b)  the  signal-to-noise  ratio  is  hi^i  enouc^i  (message  fidelity  is  high  enough). 

19.4  Impact  of  A  Priori  Message  Distributions  and  Optimizations  During  Discrete 
Message  Reception 

Let  message  x  have  m  4  1  possible  values  xq,  x^,  .  .  . ,  xm,  where  xq  —  zero 
message  corresponding  to  no  signal. 

Evidently,  the  greater  the  m,  the  closer  the  discrete  message  is  to  the  character 
of  an  analog  message,  already  examined  in  the  preceding  section.  Therefore,  we 
will  restrict  ourselves  here  to  examination  of  another  extreme  case,  corresponding 
to  m  s  1,  i.  e. ,  to  binary  detection. 

It  was  shown  in  Chapter  14  that,  given  different  optimizations  (Neyman-Pearson, 
minimum  composite  error  probability,  and  others),  optimum  receiver  structure  remains 
unchanged — only  threshold  bias  at  receiver  output  changes.  The  magnitude  of  this 
bias  for  some  optimizations  will  depend  also  on  ratio  P(xq)/P(x^)  of  the  a  priori 
signal  absence  and  presence  probabilities. 

Consequently,  for  a  broad  class  of  optimizations  examined,  optimum  receiver 
structure,  except  for  output  threshold,  will  not  depend  on  the  optimization  selected 
and  a  priori  message  distribution.  Therefore,  all  that  remains  is  to  explain 
the  extent  to  which  receiver  output  threshold  will  depend  in  the  general  case 
on  selected  optimization  and  on  a  priori  message  distribution. 

A  study  of  this  problem  conducted  in  [1251  made  the  following  results  possible: 


1.  During  reception  of  discrete  messages,  as  was  the  case  for  analog  messages, 
the  impact  of  a  priori  message  distributions  and  optimizations  (error  weights)  /373 
on  optimum  receiver  structure  and  properties  monotonically  decreases  with  an  increase 
in  signal-to-noise  ratio,  striving  asymptotically  towards  zero. 

2.  The  point  1  postulation  is  valid  at  least  for  noise  in  the  form  of  additive 
normal  white  noise  and  a  precisely-known  signal. 

3.  If  the  sigial  has  parasitic  random  parameters,  then  various  cases  are 
possible,  depending  on  the  nature  of  these  parameters. 

The  point  1  postulation  remains  completely  valid  for  some  types  of  parasitic 
random  parameters.  Such  random  parameters  may  be  called  safe  in  an  analogy  with 
the  case  of  analog  messages.  This  includes,  in  particular,  the  random  initial 
phase  of  signal  rf  occupation. 

The  point  1  postulation  may  be  satisfied,  but  not  fully,  for  other  types 
of  parasitic  random  parameters.  This  includes,  in  particular,  the  amplitude  of 
a  fluctuating  signal  distributed  in  accordance  with  Rayleigh's  law. 

4.  Error  weights  impact  upon  optimum  receiver  structure  and  properties  just 
like  a  priori  message  probabilities  do. 

19.3  Conclusion 

The  analysis  presented  in  this  chapter  makes  it  possible  to  draw  the  following 
general  conclusions  valid  for  reception  of  discrete  messages  and  individual  analog 
message  values.* 

1.  Given  an  increase  in  signal-to-noise  ratio  (increase  in  message  reception 
accuracy  or  reliability),  the  impact  of  the  a  priori  distribution  P(x)  type  on 


♦These  conclusions  may  not  be  extended  completely  to  reception  of  oscillations 
(filtration).  This  is  because,  in  the  case  of  filtration,  distributions  P(x) 
and  Py(x)  are  not  uniform,  but  are  n- dimensional;  here,  magnitude  *=2/,  r  in  many 

cases  in  very  great  and  even  will  strive  towards  infinity. 


optimum  receiver  structure  and  properties  asymptotically  will  strive  towards  zero 
if  the  signal  is  precisely  known  or  has  safe  random  parameters. 

2.  In  particular,  initial  phase  d^Q  of  rf  occupation  (constant  during  time 
of  observation  T)  falls  in  the  category  of  safe  random  parameters. 

3.  It  follows  from  point  1  that,  when  designing  the  optimum  receiver  and 
analyzing  its  properties,  it  is  possible  when  the  signal-to-noise  ratio  is  high 
to  start  in  the  first  approximation  from  uniform  a  priori  distribution  P(x)  if 
the  signal  is  precisely  known  or  has  safe  random  parameters  (random  initial  phase, 
in  particular). 

4.  Given  a  high  signal-to-noise  ratio  and  noise  in  the  form  of  normal  white 
noise  (and  not  only  given  such  noise),  a  receiver  operating  in  accordance  with 

the  maximum  inverse  probability  Py(x)  principle  (maximum  inverse  probability  density 
in  the  case  of  analog  messages)  is  optimum  for  a  broad  class  of  optimizations 
if  the  signal  is  precisely  known  or  has  safe  random  parameters.  /374 

In  the  case  of  analog  messages,  all  criteria  based  on  average  risk  minimization 
and  loss  function  I  (x,  y)  f  being  a  nondecreasing  function  of  error  modulus  hr  —  *|  , 
will  fall  into  this  class.  In  particular,  the  minimum  root-mean-square  error 
criterion  will  fall  in  this  class. 


5.  It  follows  from  point  4  that,  when  designing  the  optimum  receiver  and 
analyzing  its  properties,  given  a  high  signal-to-noise  ratio,  it  is  possible  to 
begin  in  the  first  approximation  from  any  simplest  optimization  belonging  to  the 
aforementioned  class  if  noise  has  the  form  of  normal  white  noise  and  the  signal 
is  precisely  known  or  has  safe  random  parameters.  In  particular,  it  is  possible 
to  begin  from  the  maximum  inverse  probability  density  P^(x)  criterion  or  the  minimum 
root-mean-  square  error  criterion. 

6.  Postulations  1—5  are  more  accurate,  the  higher  the  signal-to-noise  ratio 
(the  higher  the  requirement  for  message  reception  fidelity  or  reliability). 

More-detailed  analysis  of  each  specific  case  is  required  for  determination 
of  the  error  magnitude  in  the  general  postulations  presented  for  a  given  signal- 
|  to-noise  ratio  (given  specific  message  reception  reliability). 

537 

* 


CHAPTER  TWENTY 


/374 


CERTAIN  LIMITATIONS  INHERENT  IN  STATISTICAL  DECISION  THEORY  AND  WAYS  TO  SURMOUNT 
THEM 


20.1  General  Comments 

Statistical  decision  theory  presented  has  a  rather  general  nature  and  will 
find  wide  use  for  solution  of  practical  problems.  However,  as  is  the  case  with 
any  theory,  it  is  based  on  a  number  of  assumptions  and  limitations,  which,  in 
some  cases,  may  not  be  satisfied.  Therefore,  to  avoid  errors,  it  is  necessary 
to  present  these  assumptions  and  limitations  clesrly.  The  following  are  the  most 
important  ones: 

1.  The  assumption  is  that  a  priori  distributions  of  random  actions  (messages 
parasitic  signal  parameters,  noise)  are  precisely  known. 

2.  The  assumption  is  that  loss  function  ^  (*.  Y)  is  known. 

3.  It  ia  assumed  that  the  characteristics  of  all  actions  will  not  depend 
on  which  system  is  designed. 

4.  No  limitations  are  placed  on  the  system  designed,  other  than  the  general 
requirement  for  physical  realization  (which  usually  means  that  the  reaction  /37 5 


at  system  output  must  not  arise  prior  to  onset  of  its  causal  input  action)  and, 
if  necessary,  requirements  that  the  system  action  algorithm  be  determinate  (non- 
randomized) . 


5.  System  quality  is  evaluated  only  by  message  fidelity,  i.  e.,  other  important 
system  quality  indicators  (cost,  complexity,  weight,  overall  dimensions,  reliability, 
and  others)  are  not  considered. 

6.  One  number — average  risk,  characterizes  message  fidelity. 
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Several  ways  to  ameliorate  certain  limitations  already  have  been  noted,  but 
the  overall  problem,  in  light  of  its  importance,  requires  more-detailed  examination. 
The  next  chapter  is  devoted  to  the  limitation  noted  in  point  1,  while  the  remaining 
limitations  are  examined  in  this  chapter. 

20.2  Limitations  on  Selection  of  Loss  Function  I(^,  If )  Type 

As  noted  in  the  preceding  chapter,  in  a  case  of  high  message  fidelity  (validity), 
the  structure  of  the  system  designed  often  is  not  critical  where  loss  function 
type  is  concerned  within  the  broad  class  of  such  functions  used  in  practice. 

However,  there  may  be  cases  in  which  it  is  impossible  a  priori  to  be  sure  that 
the  selected  loss  function  corresponds  well  enouc^i  to  the  physical  crux  of  the 
problem  and  that  a  change  in  its  type  will  not  lead  to  a  significant  change  in 
the  structure  and  properties  of  the  system  designed.  What  does  one  do  in  such 
cases? 
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It  is  impossible  to  provide  universal  and  comprehensive  recommendations  here. 
Therefore,  we  only  will  note  several  pos8ible  variants  for  surmounting  this  dif¬ 
ficulty. 

First  variant.  Undertake  system  design  for  not  one,  but  several,  significantly- 
different  rational  loss  function  types  and  compere  the  results  obtained.  If  results 
turn  out  to  be  relatively  close  (i.  e.,  a  system  which  provides  minimum  average 
risk  R  for  one  kiss  function  provides  value  R  close  to  minimum  for  other  loss 
functions  as  well),  then  one  may  consider  the  system  design  completed.  Otherwise, 


other  system  design  methods  not  requiring  provision  of  a  specific  loss  function 
type  must  be  used.  Several  of  these  are  described  below. 

Second  variant.  Use  the  maximum  inverse  probability  criterion  (for  discrete 
messages)  or  the  maximum  inverse  probability  density  criterion  (for  analog  messages) 

However,  it  was  demonstrated  in  §  17.2  that  the  maximum  inverse  probability 
density  criterion  coincides  with  the  minimum  average  risk  criterion,  given  a  simple 
loss  function  type  ((17.36a).  Therefore,  the  second  approach  examined  is  equivalent 
to  selection  of  a  (I7.36a)-type  loss  function. 

Third  variant.  If  not  only  the  loss  function  /  (*.  V)  type,  but  a  priori 
message  distribution  P(x)  also,  are  unknown,  the  maximum  likelihood  criterion  /376 
may  be  used  [or,  which  is  equivalent,  the  second  variant  may  be  used,  having  assumed 
here  that  distribution  P(x)  is  uniform]. 

Fourth  variant.  During  binary  reception  (i.  e. ,  during  binary  detection 
or  recognition  of  two  non-zero  signals),  when  the  reproduced  message  may  have 
only  two  values,  x^  and  x2,  it  is  convenient  to  characterize  reception  quality 
by  two  quality  indicators — conditional  error  probabilities 

*1  «*  Pxt  (•*,)  and  kt  =  P„t  (x,) 

(evidently,  in  a  case  of  binary  detection,  x^  and  x2  should  be  replaced  by  xq 
and  Xj,  respectively  and  this  should  be  assumed:  =  Px,(xt)  =  Pni,  kt  =  P*,(jr0)  =  Pn p- 

Here,  it  is  possible  to  3eek  the  optimum  system  in  accordance  with  the  criterion 

k2  =  min,  where  k^  =  const. 

As  will  be  demonstrated  in  Chapter  22,  such  a  result  will  be  obtained  if 
you  use  the  criterion 

k^  s  min,  where  k2  =  const. 

Evidently,  in  a  case  of  binary  detection,  criterion  k2  s  min,  where  k^  =  const 
is  nothing  other  than  a  Neyman-Pearson  criterion. 


20.3  Assumption  3  Impact 


It  is  assumed  in  optimum  receiver  system  design  that  the  distribution  of 
probabilities  P(x,  y)  s  P(x)Px(y)  will  not  depend  on  the  type  of  needed  decision 
rule  P(y).  This  is  equivalent  to  the  supposition  that  message,  parasitic  signal 
parameter,  and  noise  characteristics  will  not  depend  on  designed  receiver  structure. 

In  actuality,  this  supposition  may  tum  out  to  be  invalid,  at  least  for  the  following 
reasons. 

First,  if  the  actual  system  is  built  in  accordance  with  optimum  decision 
rule  FRp  (y)  found,  then  this  system  must  to  some  degree  differ  from  the  ideal  system, 
due  to  presence  of  internal  instabilities  and  noise,  for  example.  Considering 
internal  instabilities  and  noise,  a  system  realizing  some  ncn-ideal  rule  P (y) 
may  tum  out  to  be  better  than  a  system  realizing  rule  r,,p(y),  i.  e. ,  the  system 
design  result  may  be  incorrect. 

The  aforementioned  idealization,  in  particular,  led  to  the  fact  that  amplifiers 
will  not  be  included  in  the  structure  of  all  optimum  receivers  described  in  preceding 
chapters  (Figures  4.3,  5.2,  5.3,  and  so  on).  There  may  be  no  amplification  of 
mixture  y(t).  This  postulation  would  be  valid  if  the  mixture  y(t)  processor  [i.  e., 
device  realizing  optimum  algorithm  r kP <Sf) 3  would  not  include  any  additional  noise. 
However,  any  actual  processor  will  contain  internal  noise.  Therefore,  in  order  /377 
for  resultant  algorithm  (y)  actually  to  be  optimum,  in  the  general  case  noise 
action  of  the  processor  realizing  this  algorithm  must  be  infinitesimally  slight, 
i.  e.,  processor  output  signal  level  must  be  high  compared  to  processor  internal 
noise  level.  This  signifies  that  an  amplifier  providing  requisite  amplification 
must  in  the  general  case  be  connected  between  receiver  antenna  and  processor. 

As  was  demonstrated  in  preceding  chapters,  a  linear  filter  usually  is  the 
optimum  receiver  input  section.  Here,  preliminary  amplification  and  linear  filtration 
may  be  combined  in  the  same  receiver  stages}  the  internal  noise  of  these  stages 
is  considered  by  supplying  (converting)  it  to  amplifier  input  and  including  it 
within  additive  noise  (0  within  input  signal-plus-noise  y(t). 


A  second  reason  disrupting  the  validity  of  assumption  3  may  be  the  presence 
of  feedback  from  receiver  output  to  its  input.  Such  feedback  may  occur,  for  instance, 


if  the  designed  receiver  is  included  in  a  closed  radio  control  system  (in  a  missile 
target  seeker  system,  for  example).  Presence  of  this  feedback  will  lead  to  the 
fact  that  distribution  P(x,  y)  =  P(x)Px(y)  turns  out  to  be  dependent  on  receiver 
output  voltage  V  (t)  and,  consequently,  also  on  needed  decision  rule  Y~  f  (</)• 
Therefore,  optimum  decision  rule  ^kP  (y)  found  without  considering  feedback  may 
already  not  be  optimum  when  this  feedback  is  considered. 

Strict  mathematical  system  design  of  optimum  receivers  included  in  a  closed 
control  system  turns  out  to  be  significantly  more  complex  (see  [133],  for  example]. 
However,  in  a  number  of  cases,  results  obtained  throu^i  closed  system  design  turns 
out  in  the  first  approximation  to  be  valid  also  when  the  receiver  operates  as 
part  of  a  closed  control  system  (see  [144],  for  example). 

A  third  reason  for  disruption  of  assumption  3  validity  arises  during  receiver 
system  design  given  organized  (intentional)  noise  action. 

An  enemy  will  strive  to  create  interference  that  would  act  the  strongest 
against  a  system  we  have  designed.  Therefore,  if  we  succeed  in  finding  some  algorithm 
r„p  (y)  ,  which  minimizes  interference  of  a  given  type,  then  the  enemy  will  attempt 
to  change  the  nature  of  the  interference  in  order  to  intensify  its  action.  Therefore, 
in  this  case,  called  a  conflict  (adversary)  situation,  optimum  receiver  algorithm 
Ja»  (if)  will  depend  input  signal-plus- noise  y(t)  characteristics,  while  these 
characteristics  in  turn  are  dependent  on  the  receiver  action  algorithm. 

Receiver  system  design  in  conflict  situations  is  complicated  greatly.  The 
most-typical  system  design  approaches  in  such  situations  are: 

1.  Use  of  the  approximation  by  iteration  method,  which  consists  of  the  following 
stages. 

In  the  first  stage,  noise  characteristics  are  supplied  based  on  a  priori  /378 
data  and  "common  sense"  and  the  optimum  receiver  structure  corresponding  to  these 
characteristics  is  found,  i.  e. ,  algorithm  1* „p  (y)t  which  may  be  considered  the 
zero  approximation. 

In  the  second  stage,  the  noise  (within  the  boundaries  of  given  rational 


limitations)  which  will  exert  the  strongest  interference  for  resultant  structure 
r#  Kp  (y)  and  the  following  (first)  approximation  r„,p  (y)  for  optimum  receiver  structure 
is  found  considering  this  noise.  Next,  where  required,  second  approximation 
r»p  (y)  and  so  on  may  be  found  analogously.  However,  one  should  note  that,  in  a 
number  of  cases,  approximation  by  interation  convergence  turns  out  to  be  too  slow 
or  completely  is  lacking. 

2.  Refusal  to  perform  mathematical  system  design,  performance  of  a  comparative 
analysis  of  the  noise  immunity  of  different  specific  receiver  construction  variants 
(given  several  organized  noise  characteristic  variants),  and  selection  of  the 
most-suitable  variant. 

3.  Use  of  game  theory.  Possible  problem  formulations  when  game  theory  is 
used  are  examined  in  the  following  section. 

20.4  Problem  Formulation  When  Game  Theory  is  Used 

Game  theory  is  most  advanced  for  so-called  zero-sum  pair  games.  Here,  the 
receiver  system  design  problem  being  examined  is  formulated  in  the  following  manner. 
Only  two  players  participate  in  the  game  (pair  game)— receiver  designer  (#i)  and 
organizer  of  the  interference  disrupting  the  operation  of  the  receiver  (^t) . 

The  interests  of  the  players  are  directly  contradictory:  a  win  for  one  of  them 
is  a  loss  for  the  other  (zero-sum  game).  Each  player  will  strive  to  play  in  such 
a  way  as  to  insure  the  best  result  for  himself  in  the  supposition  that  the  opponent 
will  strive  to  make  this  best  result  the  worst.  Consequently,  game  theory  provides 
an  opportunity  to  find  the  best  algorithm  for  the  worst  potentially-possible 
case. 

Games  are  divided  into  discrete  (matrix)  and  analog  (differential).  In  turn, 
discrete  games  are  divided  into  single-step  and  multistep.  In  a  single-step 
(single-move)  game,  the  opponents  make  a  total  of  one  move  each  and  the  game  ends. 

As  opposed  to  this,  a  multistep  (multimove)  game  comprises  a  sequence  of  moves 
made  by  each  opponent.  The  theory  of  multistep  games  is  considerably  more  complex 
and  closely  approximates  the  theory  of  dynamic  programming. 

In  the  case  of  a  discrete  single-step  game,  the  task  of  the  first  player 


boils  down  to  selection  of  some  strategy  (move)  ?v  from  a  finite  number  M  of 
possible  moves,  while  the  task  of  the  second  player  is  to  select  some  strategy 
(move)  Pn  of  N  possible  moves  (  v  s  1,  2,  .  .  .,  N;  =  1,  2,  .  .  N).  The 

results  of  each  possible  combination  of  moves  (v>  (0  are  evaluated  by  some  /379 
number  u.  equating  to  a  win  for  the  first  player,  and  thereby  a  loss  for  the 
second.  The  following  win  (loss)  matrix  corresponds  to  the  aggregates  of  all 
possible  combinations  of  opposing  strategies 

y4  =  |  Qv.n  j  i 


called  a  payoff  matrix. 

For  example,  this  situation  may  be  reduced  to  a  discrete  single-step  game. 

M  possible  receiver  action  algorithms  Yi  F,  (y) Y*  “  I*  (y) .  Y m  =  Tm  (y)  exist. 

It  is  known  that  the  enemy  may  use  any  of  N  possible  organized  interferences 
Pi.  •••.  Pit.  Pat  against  the  receiver.  Since  all  possible  algorithms  Tv  and  all 

possible  interferences  P*  are  assumed  to  be  known  beforehand,  it  is  possible 
through  analyis  to  estimate  result  a*. »  of  the  action  of  interference  P„  on  receiver 
with  algorithm  rv  .  The  corresponding  receiver  quality  index  may  be  selected 
here  as  aY.  „  . 

For  example,  if  the  receiver  is  designed  for  signal  detection,  composite 
sigial  detection  error  probability  Pom  (or  1  —  Pom )  may  be  such  an  index. 

The  problem  is  to  select  from  all  possible  strategies  (algorithms)  r„  ....  rM  the 
one  which  provides  the  best  (the  least  if  a  =  P am  or  the  greatest  if  a  =  1  — 

P on  )  value  of  quality  index  Oy. »  ,  given  that  the  enemy  uses  the  most  dangerous 
(in  the  sense  of  the  same  qiality  index)  of  interferences  Pi.  ....  Pjv  . 

For  specificity,  we  will  assume  that  quality  index  a*.  „  is  such  that  the 
less  it  is,  the  better  the  receiver  (i.  e.,  the  greater  the  win  for  the  first 
player).  For  instance, 

Oy.li  =  » 


where  Ry. »  —  average  risk  determined  given  that  type  Pn  interference  acts  on 


v  v  v'  •_  v  *.  v  *.  *- 


the  receiver  and  algorithm  r»  j.s  selected.  Then,  the  problem  may  be  formulated 
mathematically  in  the  following  manner. 


Average  risk  Ry.  »  magnitude  will  depend  on  selected  receiver  algorithm 
and  interference  type  Pit  ,  i.  e. , 


Rv.li  —  R(r*i  P(,)> 


(20.1) 


Since  fully-determinate  v  and  p  correspond  unambiguously  to  numbers  T, 
and  Pit  ,  instead  of  (20.1),  it  also  is  possible  to  write 


Ry.it  =  /?(v,  p). 


(20.1a) 


The  first  player  will  strive  with  ry  selection  to  minimize  magnitude  Ry. » 
for  such  noise  p4  at  which  this  minimal  magnitude  is  maximum.  For  every  strategy 
Ty  ,  the  worst  (most  dangerous)  strategy  P»  will  be  that  for  which 


W(ry,  p»»)  =  max /?  (Ty,  P(i). 

Pit 

Therefore,  the  first  player  must  select  strategy  I*y  so  that  magnitude  /3B0 

max  R( rv,  p„)  wiH  be  minimal,  i.  e. ,  from  the  condition  that 
•h* 


max  R  (r v,  P„)  =  min  [max  R  (ry,  p^)I  =  Rt. 

P(l  rV  Pit 


(20.2) 


The  second  player,  reasoning  in  an  analogous  manner,  must  select  strategy 
Pu  from  the  condition 


min /?(!%,  P„)  =  maxrmin/?(ry, 


Pu)J  -  Rt- 


(20.3) 


In  other  words,  the  first  player  will  start  from  the  minimax  criterion  (minimizes 
maximum  magnitude  of  risk  R),  while  the  second  starts  from  the  opposite,  the  maximin 
criterion  (maximizes  the  minimum  mayiitude  of  risk  R). 


Since  dependence  R  (rv.  Pu)  is  assumed  known,  then  it  is  possible  from  relationship 
(20.2)  to  find  strategy  Pu  providing  the  risk  R  maximum  and  strategy  1%'  ,  which 
minimizes  this  maximum.  Consequently,  performing  operations  stipulated  by  rela¬ 
tionship  (20.2)  makes  it  possible  to  find  opposing  strategy  values  P4'  and  Tv' . 
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Analogous  to  this,  opposing  strategy  values  Pm'  and  IV' are  obtained  from  rela¬ 
tionship  (20.3).  Here,  average  risk  value  equalling  corresponds  to  strategy 
combination  IV),  while  value  Rg  corresponds  to  combination  (fV.  Tv*) .  n 
is  shown  in  game  theory  that  always 

Rx>K  (20.4) 

If  it  turns  out  that 

(20.5) 

then,  here 

P 4  =  Pi  and  ri  =  r;,  (20.6) 

and,  consequently,  game  theory  makes  it  possible  unequivocally  to  determine  both 
the  optimal  strategies  of  both  players,  and  the  corresponding  game  outcome  (game 
value) .  i.  e.,  magnitude  R. 

However,  it  may  happen  when  solving  some  problems  that 

/?!>/?,.  <2D«7) 

In  this  case,  the  aforementioned  approach  does  not  make  it  possible  to  find  either 
optimal  strategies  or  game  value  and,  obtaining  a  decision  requires  modification 
in  the  problem  formulation,  introducing  in  place  of  "pure"  (determinate  nonrandomized) 
strategies  rv,  p„  so-called  "mixed"  (stochastic  randomized)  strategies.  Here, 
players  select  strategy  rv  and  Pn  randomly  (through  use  of  the  corresponding 
independent  statistical  mechanisms)  in  accorcbnce  with  probability  distributions 


to  —  *  —  *7*.  •»  «.  -« 


where 


/381 

M  jv 

2pv  =  i.  S?4  =  i. 

V—  I  It—  I 


When  such  mixed  strategies  are  used,  magnitude  R  (v,  I')  turns  out  to  be  random 
and  the  game  outcome  is  estimated  by  expected  value 

_  *  *  N 
W*=R(pt  %R*.nPvQif 

v-m-i  (20.9) 

Here,  player  Ht  will  strive  to  select  distribution  q  so  that  magnitude  W(p,  q) 
will  be  maximum,  while  player  Ht  selects  distribution  p  so  that  this  maximum 
magnitude  will  be  the  least.  Therefore,  in  a  case  where  mixed  strategies  are 
used,  the  following  relationships  are  the  analog  of  expressions  (20.2)  and  (20.3), 
respectively : 


max7?(p,  <7)  =  min|’max/?(pf  g)J  =  K,\ 

(20.2a) 

min7?(p,  q)  =  max  j’min  (p,  <?)j  =  Rt'- 

(20.3a) 

Here,  the  result  always  is 


(20.10) 


where 


(20.11) 


It  follows  from  (20.10)  that,  when  mixed  strategies  are  used,  a  game  (of 
the  examined  class)  always  has  an  outcome  (i.  e.,  fully-determinate  optimum  dis¬ 
tribution  p  and  q  values  and  corresponding  game  outcome  value  R*  exist),  which 
may  be  found  from  relationship  (20.2a)  or  (20.3a). 


It  follows  from  (20.11)  that,  when  mixed  (rather  than  pure)  strategies  are 
used,  each  of  the  players  may  not  worsen  his  outcome  and,  where  R_  >  R.,  he  may 


even  improve  it.  Therefore,  if  use  of  pure  strategies  will  lead  to  the  inequality 
of  magnitudes  R^  and  R2,  the  switch  from  pure  to  mixed  strategies  is  completely 
logical. 

Insurmountable  difficulties  usually  are  not  encountered  when  finding  outcomes 
of  discrete. games  of  the  class  examined.  An  outcome  with  the  direct  method  is 
possible,  given  a  small  number  of  possible  strategies  (small  M  and  N),  while  linear 
programming  methods  are  used  when  these  numbers  are  large.  In  complex  cases, 
recommendations  made  in  [132]  are  based  on  use  of  the  corresponding  pattern  algo¬ 
rithms. 

Analog  (differential)  games  also  may  be  based  on  use  either  of  pure  or  of 
mixed  strategies.  But,  in  the  case  of  analog  games,  the  possible  algorithm  types 
Y  —  T  Uj) are  not  limited  to  a  finite  number  of  discrete  values,  but  may  vary  con¬ 
tinuously  (within  the  range  of  given  limitations).  This  circumstance  /382 

increases  mathematical  difficulties  in  problem  solution  to  such  an  extent  that, 
at  present,  methods  for  solving  analog  games  have  been  developed  only  for  certain 
particular  cases. 

At  present,  a  great  deal  of  literature  has  been  devoted  to  game  theory  [140, 

141,  142,  143,  and  others].  However,  its  use  for  radio  receiving  system  design 
still  is  very  limited  (see  [171],  for  example).  This  mainly  is  explained  by  the 
following. 

First,  the  mathematical  "game"  model  described  above  only  very  roughly  describes 
an  actual  adversary  situation  arising  when  providing  electronic  countermeasures. 

Second,  this  mathematical  model  makes  it  possible  to  find  the  optimum  receiver 
structure  only  in  the  supposition  of  the  worst  possible  interference.  If  this 
interference  in  actuality  does  not  tum  out  to  be  the  worst,  then  the  resultant 
receiver  structure  will  not  tum  out  to  be  optimum. 

Third,  solution  of  the  problem  in  the  case  of  an  analog  game  may  encounter 
serious  mathematical  difficulties. 

However,  given  a  conflict  (adversary)  situation,  game  theory,  in  spite  of 


its  limitations  and  shortcomings,  at  present  is  the  mathematical  vehicle  corresponding 
most  closely  to  the  nature  of  this  situation. 

20.5  Certain  Additional  Limitations  Inherent  in  Statistical  Decision  Theory 

We  now  will  dwell  on  assumptions  and  limitations  4,  5,  and  6  noted  on  pages 
558  and  539. 

Several  limitations  are  placed  on  a  receiver  during  its  actual  development 
(design  process),  the  main  ones  usually  being  cost,  weight,  and  overall  dimensions. 
During  receiver  system  design  based  on  the  aforementioned  statistical  decision 
theory,  these  very-important  restrictions  and  quality  indicators  are  not  considered 
quantitatively  at  all.  Only  a  certain  hidden  (qualitative)  consideration  of  these 
limitations  is  possible.  It  may  comprise,  for  example,  the  following: 

1.  Optimum  algorithm  r«p  (y)  is  soucfit,  not  in  the  class  of  all  physically 
realizable,  but  only  in  the  class  of  determinate  (nonrandomized)  algorithms. 

This  signifies  that  statistical  (randomized)  algorithms,  usually  (but  not  always) 
being  more  complex  (and,  consequently,  having  greater  cost,  weight,  and  overall 
dimensions)  are  not  considered. 

2.  It  is  possible  to  seek  optimum  algorithm  r„p  (y)  only  in  the  class  of  linear 
systems  (as  is  done  in  §  2.2,  for  example).  Since  linear  algorithms  usually  (but 
not  always)  are  simpler  for  realization,  then  this  will  lead  to  a  decrease  in 
cost,  weight,  and  overall  dimensions. 

3.  It  is  possible  to  seek  the  optimum  algorithm  in  the  single-channel,  rather 
than  multichannel,  system  class. 

However,  first,  not  in  all  cases  is  a  nonrandomized  algorithm  simpler  than 
a  randomized  one,  a  linear  system  simpler  than  a  nonlinear  one,  or  a  single-  /383 
channel  system  simpler  than  a  multichannel  one.  Second,  there  are  other  ways 
to  simplify  the  system.  Finally,  there  usually  is  s  requirement  to  estimate  the 
possible  decrease  in  cost,  weight,  and  overall  dimensions  both  qualitatively  and 
quantitatively. 

For  these  reasons,  optimum  algorithm  TM(y)  found  from  statistical  decision 
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theory  usually  may  not  be  considered  final  and  requires  further  estimation  and 
adjustment  considering  other  quality  indicators,  primarily  reliability,  cost, 
weight,  and  overall  dimensions. 

A  statistical  decision  theory  (in  its  contemporary  form)  limitation  is  not 
only  that  it  does  not  consider  any  receiver  qualities  other  than  message  fidelity 
at  all,  but  also  that  the  latter  is  considered  only  by  a  single  number— average 
risk  R. 

By  definition,  average  risk  equals  the  expected  value  of  loss  /  (*,  y)  arising 
when  decision  y  is  taken  about  message  x.  But,  the  expected  value  is  far  from 
the  complete  characteristic  of  random  magiitude  (or  random  process)  I  (*,  y)  . 

In  particular,  besides  expected  value  R  s  T  of  the  loss  magnitude,  also  of  significant 
value  may  be  this  magnitude's  variance 

of-JJ-Tp. 

Therefore,  a  population  of  two  quality  indices— average  risk  R  and  risk  variance 
Oi*  ,  is  a  more-complete  (but,  in  tum,  not  absolutely  complete)  message  fidelity 
characteristic.  However,  optimum  receiver  system  design  based  on  a  population 
of  two  or  more  indices  is  in  the  general  case  considerably  more  complex.  Several 
special  features  of  this  system  design  are  examined  in  Chapter  22. 

To  conclude  this  chapter,  we  will  note  the  following.  Statistical  decision 
theory  makes  it  possible  to  find  structure  r„p  (y)  of  a  receiver  providing  minimum 
average  risk  value  R  =  Rma  .  When  seeking  this  structure,  no  consideration 
is  given  to  a  series  of  very-important  factors  [a  priori  data  ambiguity,  nonideality 
of  the  algorithm  r*P  (y)  realization,  presence  of  an  entire  series  of  additional 
quality  indices  such  as  cost,  weight,  overall  dimensions,  reliability,  loss  variance 
9/a  ,  and  others].  However,  consideration  of  any  of  these  factors  (or  the  entire 
aggregate  of  factors)  may  not  lead  to  a  decrease  in  average  risk  R  compared  to 
£**■  »  "tty  cause  only  an  increase  in  average  risk.  Therefore,  one  may  assert 
that  statistical  decision  theory  makes  it  possible  to  find  the  potential  (best 
theoretical ly-possible)  value  of  the  average  risk  magnitude  which  may  not  be  exceeded 
(improved)  in  any  real  system,  considering  all  its  inherent  limitations  and  quality 
indices. 


Knowledge  of  this  potential  main  receiver  quality  index  value  is  very  important 
during  design  of  actual  radio  receiving  devices.  Therefore,  statistical  decision 
theory,  in  spite  of  all  its  serious  inherent  limitations,  has  found  very  wide 
use  in  radio  receiving  device  system  design. 


CHAPTER  TWENTY-ONE 
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BRIEF  DESCRIPTION  OF  RECEIVER  SYSTEM  DESIGN  METHODS  GIVEN  INCOMPLETE  A  PRIORI 
INFORMATION 


21.1  General  Description  of  Possible  System  Design  Problem  Formulation  Variants 
and  Ways  to  Solve  Them 

Use  of  "classic"  statistical  decision  theory  presented  in  Chapter  17  requires 
knowledge  of  the  a  priori  distribution  of  message  P(x)  and  likelihood  function 
Px(y).  The  latter,  in  turn,  is  determined  from  the  distributions  of  parasitic 
signal  parameters  (non-additive  noise)  and  additive  noise  [see  relationships 
(17.2)— (17.6)].  If  all  these  distributions  are  known,  then  a  so-called  case 
of  complete  a  priori  information  occurs  and  the  optimum  receiver  structure  may 
be  found  by  average  risk  R  minimization,  i.  e.,  from  condition  (17.29). 

Essentially,  however,  all  or  some  of  these  distributions  usually  are  not 
known  precisely*,  i.  e.,  a  so-called  case  of  incomplete  a  priori  information  occurs. 
In  other  words,  prssence  of  complete  a  priori  information  may  be  considered  as 
a  threshold  (extreme)  particular  case  of  a  real  situation  in  which  a  priori  infor¬ 
mation  is  always  incomplete  to  some  degree. 

*In  addition,  in  several  cases,  a  priori  distributions  may  in  the  process  of 
receiving  device  operation  cggnge  in  accordance  with  unknown  a  priori  laws. 


Since  phenomena  in  nature  vary  infinitely,  the  degree  of  a  priori  information 
incompletene38  also  may  vary  within  unlimited  bounds.  Therefore,  it  is  impossible 
in  principle  to  create  a  single  theory  which  would  encompass  all  possible  cases 
of  a  priori  information  incompleteness  and  thus  be  efficient  enoucfi  in  all  cases. 

One  may  only  indicate  individual  classes  of  phenomena  and  some  theoretical  approaches 
corresponding  in  some  degree  or  other  to  each  of  these  classes  or  some  group  of 
classes.  Therefore,  at  present,  the  theory  of  optimum  reception  methods,  given 
incomplete  a  priori  information,  still  is  in  the  initial  development  stage,  in 
spite  of  the  fact  that,  in  recent  years,  many  works  have  been  devoted  to  it  [131, 

132,  137,  150,  151,  160-170,  and  others]. 

This  chapter  has  as  its  goal  only  to  familiarize  the  reader  with  the  ideas 
and  methods  which  make  it  possible,  to  a  certain  degree,  to  surmount  this  so-called 
a  priori  difficulty.  Several  ways  of  overcoming  difficulties  arising  when  a  priori 
distribution  P(x)  of  messages  is  unknown  already  have  been  pointed  out  in  preceding 
chapters. 

Thus,  it  was  demonstrated  in  Chapter  19  that,  given  an  increase  in  message 
fidelity  (validity),  the  influence  of  distribution  P(x)  type  on  optimum  receiver 
structure  and  properties  will  strive  towards  zero  if  the  signal  is  precisely  known 
or  has  safe  parasitic  parameters.  Therefore,  given  sufficiently-high  message  /385 
fidelity  (reliability),  it  is  possible  in  some  cases  to  assume  (in  the  first 
approximation  at  least)  that  distribution  P(x)  is  uniform.  If,  along  with  dis¬ 
tribution  P(x),  loss  function  /  (*,  v)also  is  unknown,  then  it  is  permissible  in 
some  cases,  as  noted  above,  to  use  the  maximum  likelihood  criterion  rather  than 
the  average  risk  criterion. 

Another  possible  way  is  use  of  the  minimax  criterion,  i.  e. ,  determination 
of  the  least-favorable  a  priori  distribution  P^Cx)  in  which  minimum  average  risk 
is  maximum.  However,  this  approach  also  is  not  always  advisible  because  the  actual 
distribution  may  be  significantly  more  favorable  also  due  to  mathematical  difficulties 
that  arise. 

In  binary  reception,  when  the  message  has  only  two  values,  xq,  and  x^,  in 
a  case  of  unknown  probabilities  P(xQ)  and  P(x^  si—  P(xQ),  it  is  possible  to 
avoid  characterizing  reception  quality  by  a  single  index— average  risk  R,  and 
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to  characterize  it  by  two  indices — conditional  risks  R  and  R  .  In  a  case  of 

xo  X1 

binary  signal  detection  and  uiitary  detection  error  weights,  the  result  is 
R*,  =*  Pa  t  and  /?,,  =  P np>  and  the  problem  boils  down  to  finding  the  structure  of 
a  receiver  providing  Pnv  -  min  where  Pa  T  =  const  (Neyman-Pearson  criterion). 

All  these  approaches  are  based  on  complete  disregard  for  the  a  priori  distri¬ 
bution  P(x)  type.  However,  in  several  cases,  especially  in  filtration,  this  complete 
disregard  may  be  risky,  even  given  relatively-high  message  fidelity.  Here,  in 
several  cases,  the  following  method  may  tum  out  to  be  useful. 

Let  the  requirement  be  to  extract  message  x(t)  from  additive  mixture 

y(t)=x(t)+um(t).  (21.1) 

It  was  shown  in  §  2.2  that,  when  the  optimum  system  is  sought  in  the  class 
of  linear  systems  and  the  loss  function  selected  is  quadratic,  optimum  system 
structure  and  properties  completely  are  determined  by  correlation  functions 
Rx(tlf  tg)  and  Rm  (tu  *»)* .  Consequently,  given  this  system  design  problem  formulation 
there  is  no  requirement  to  know  complete  multidimensional  distribution  P(x)  of 
random  process  x(t).  It  suffices  to  know  its  correlation  function  Rx(t^,  tg). 
Therefore,  if  mixture  y(t)  has  the  (2.1)  form  and  nothing  is  known  about  message 
x(t),  with  the  exception  of  Rx(t^,  i®  advisible  to  use  the  same  designed 

system  class  limitations  and  loss  function. 

Up  to  this  point,  we  have  examined  only  that  part  of  the  a  priori  difficulty 
linked  with  complete  or  partial  a  priori  message  distribution  uncertainty.  However, 
in  the  general  case,  parasitic  signal  parameter  and  additive  noise  distributions 
may  be  unknown  to  a  certain  degree  as  well.  Therefore,  in  more-general  form, 
the  problem  may  be  formulated  as  follows. 

In  a  case  of  complete  a  priori  information,  distribution  /386 

P(x,  y)  s  P(x)Px(y), 

*If  processes  x(t)  and  uad)  are  interrelated,  then  one  also  must  know 
R,  wOi.  W. 


required  to  compute  average  risk  R  is  precisely  known.  In  a  case  of  incomplete 
a  priori  information,  distribution  P(x)  may  be  known,  with  the  exception  of  the 
parameter  population 


v  =  (v . vmj,  (21.2) 

while  distribution  ?x(y)  is  known,  except  for  parameter  population 


P  —  (Pii  •••  •  Pil • 

This  means  that  the  only  knowns  are  conditional  distributions 


(21.3) 


P7. H (Xt  0  =  PT  W pi  M  (21.4) 

The  degree  of  parameter  V  and  [J.  uncertainty  may  differ.  Here,  the  following 
cases  may  occur: 

1.  Unknown  parameters  V  and  jJL  are  random  magnitudes  (or  processes), 
the  laws  of  distribution  of  which  P(  y  )  and  P(/X  )  are  precisely  known.* 

2.  Unknown  parameters  V  and  jJi  are  random  megiitudes  (or  processes), 
i.  e.,  distributions  P(  V  )  and  P(  fJL  )  exist;  however,  as  opposed  to  the  first 
case,  these  distributions  are  unknown. 

3.  Parameters  V  and  £4  a  fortiori  are  non- random  (determinate)  unknown 
magnitudes  (or  time  functions). 


4.  It  also  is  unknown  whether  parameters  0  and  ^4  are  random  or  not. 

In  all  these  cases,  the  assumption  is  that  the  type  of  laws  of  distribution 
P(x)  and  Px(y)  is  known;  the  only  unknowns  are  some  parameters  If  and  fJL  of 
these  laws  (expected  value  and  variance,  for  example).  However,  there  are  cases 


*If_there  is  a  statistical  link  between  V  and  ^4  ,  then  their  joint  distribution 
P(  \J  ,  jL4  )  is  known. 
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when  even  the  law  of  distribution  types  is  unknown.  These  extreme  (of  all  those 
examined  above)  cases,  in  the  sense  of  a  priori  information  incompleteness,  in 
mathematical  statistics  usually  are  called  nonparametric.  The  following  problem 
of  binary  detection  of  a  signal  on  a  noise  background  may  serve  as  an  example 
of  a  nonparametric  problem. 

The  requirement  is  to  detect  whether  or  not  a  signal  is  present  from  analysis 
of  sample  y  =  (y^,  .  .  .,  y^,  .  .  .,  y  .)  comprising  n  independent  sample  values 
y^.  Nothing  is  known  about  law  of  distribution  (integral)  F(y^),  except  that 
this  distribution  is  even  when  there  is  no  signal  and  parity  is  disrupted  /387 
when  there  is  a  signal.  As  will  be  demonstrated  in  §  21.3,  given  a  large  sample, 
n,  even  such  minimal  a  priori  information  may  suffice  for  signal  detection. 

Especially  complex  are  cases  when  it  is  unknown  whether  distributions  are 
stationary  (i.  e.,  constant  over  time)  or  non-stationary  or  when  it  is  known  that 
they  change  over  time,  but  the  law  of  this  change  a  priori  is  unknown.  In  future, 
we  will  assume  that  unknown  distributions  may  be  considered  stationary  (or  changing 
over  time  in  accordance  with  a  priori  known  lews). 

Finally,  special  note  should  be  taken  of  a  case  of  intentional  (organized) 
interference.  In  this  case,  during  receiver  system  design,  it  is  known  that  the 
enemy  will  premeditatively  change  interference  distributions,  but  it  is  unknown 
just  how  this  will  be  accomplished.  In  such  an  adversary  situation,  as  noted 
in  the  previous  chapter,  game  theory  is  the  most-adequate  mathematical  vehicle. 

Since  problem  formulation  in  game  theory  already  is  examined  above,  in  this  chapter 
we  will  examine  only  unintentional  interference. 

Preliminary  (prior  to  receiver  system  design)  receipt  of  unavailable  a  priori 
information  through  accumulation  and  processing  of  the  corresponding  experimental 
statistical  material  is  the  most-evident  and  -radical  way  to  surmount  an  a  priori 
difficulty.  If,  in  accordance  with  problem  conditions,  it  is  possible  to  take 
these  measures,  it  is  advisible  in  a  majority  of  cases  to  do  so  since  an  increase 
in  a  priori  information  about  messages,  signals,  and  noise  makes  it  possible  to 
improve  ro»«spge  fidelity.  However,  in  a  number  of  cases,  preliminary  supplementation 

of  unaval _  j  a  priori  information  is  impossible  in  principle  or  due  to  a  shortage 

of  time  and  difficulty  in  performing  experiments. 
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Another  approach  lies  in  creation  of  possibilities  for  preliminary  patterning 
of  the  receiving  device,  using  "demonstration"  methods.  Here,  during  the  pattern 
process,  not  only  some  realization  yQ(t)  is  supplied  to  receiver  input,  but, 
simultaneously,  it  is  indicated  (noted)  whether  there  is  a  signal  in  this  realization 
or  not  (in  detection  problems),  and,  if  so,  then  exactly  which  signal  (in  signal 
discrimination  problems).  In  the  general  case,  it  is  possible  to  conduct  a  demon¬ 
stration,  not  of  single  realization  yQ(t),  but  a  population  of  several  realizations, 

. ifT’Ml. 

with  simultaneous  indication  of  the  special  features  of  each  realization  (for 
example,  indication  that  there  is  absolutely  no  signal  in  realization  y{ o*(t)  , 
signal  uei(t )  will  be  contained  in  realization  y(V(t),  signal  uea(/)will  be  contained 
in  realization  y'S*  (0 ,  only  the  signal  carrier  oscillation  will  be  contained  in 
realization  y'V  (t) ,  i.  e. ,  there  is  no  modulation  by  the  measured  message,  and 
so  forth). 

Consequently,  during  patterning,  the  decision  on  message  x  reproduction  will 
be  taken  in  the  general  case  based  on  two  realization  populations,  yQ(t)  /388 
and  y(t),  where  yQ(t)  —  population  of  pattern  realizations,  y(t)  —  population 
of  operating  realizations.  Operating  population  y(t)  reaches  receiver  input  as 
the  receiver  operates  under  actual  conditions  rather  than  pattern  conditions. 
Therefore,  it  may  not  be  accompanied  by  pattern  indices. 

Evidently,  the  pattern  process  will  be  effective  only  when  there  is  confidence 
that  noise,  signal,  and  message  statistical  characteristics  remain  unchanged  during 
transition  time  from  the  pattern  to  the  actual  operating  process. Therefore, 
in  a  number  of  problems,  it  is  possible  to  require  realizations  yQ(t)  and  y(t) 
to  be  separated  over  time  by  relatively-slight  intervals  or  even  be  transmitted 
sequentially  in  parts  (or  simultaneously  if  proper  division  by  frequency  or  by 
another  feature  is  possible). 

In  radio  communications  systems,  the  patterning  idea  began  to  be  realized 
many  decades  ago  by  sending  so-called  pilot  signals,  i.  e.,  an  oscillation  not 
modulated  by  the  transmitted  message  and  close  in  frequency  to  the  operating  signal 
carrier  frequency.  Based  on  distortions  of  the  pilot  signal  reaching  receiver 
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input,  it  is  possible  to  judge  the  nature  and  intensity  of  non-additiv/e  (modulated) 
noise  arising  in  the  communications  channel. 

Since  that  time,  various  pattern  principles  to  a  certain  degree  are  being 
used  in  other  fields  of  radio  electronics  as  well.  However,  if  until  recently 
pattern  algorithm  selection  has  been  intuitive,  at  present,  statistical  decision 
theory,  the  theory  of  stochastic  approximation  and  other  methods  of  mathematical 
statistics  began  to  be  used  in  their  design.  Examples  of  such  an  approach  are 
given  in  §  21.4  and  21.5. 

However,  in  a  number  of  cases,  patterning  is  impossible  or  inadvisible,  due 
to  equipment  complexity  arising  at  this  time,  for  instance.  In  such  cases,  the 
decision  about  the  reproduced  message  must  be  taken  only  on  the  basis  of  operating 
realization  y(t)  [or  y(t)]  and  a  priori  information  incompleteness  impacts  to 
the  greatest  degree.  It  is  all  the  more  important  under  such  difficult  conditions 
to  find,  where  possible,  the  optimum  receiver  action  algorithm.  However,  solution 
of  this  problem,  as  already  noted,  will  be  found  in  the  initial  stage  and  results 
obtained  mainly  for  a  case  of  binary  signal  detection. 

At  present,  the  following  basic  optimum  receiver  system  design  methods  are 
known  for  incomplete  a  priori  information  (excluding  game  theory,  already  examined 
in  Chapter  20): 

1.  Methods  which  are  a  development  of  statistical  decision  theory. 

2.  Nonparametric  methods  of  mathematical  statistics. 

3.  Methods  based  on  the  theory  of  stochastic  approximation. 

A  brief  description  of  these  methods  is  provided  in  the  following  sections. 
21.2  Methods  Developed  From  Statistical  Decision  Theory  /389 

1.  General  Description 

Methods  developed  from  statistical  deci8ion  theory  are  used  in  cases  when 


*; 
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a  priori  distribution  P(x,  y)  =  P(x)P  (y)  is  known  (or  assumed  known),  with  the 
exception  of  some  parameter  population  (  V  ,[!)  [see  (21.2)—  (21.4)],  i.  e., 
the  only  known  is  a  conditional  distribution  of  the  (21.4)  type,  which  may  be 
designated  P~  (x,  y) ,  where  0  =  Jv,  p)  —  population  of  all  unknown  distribution 
parameters. 

If  a  priori  distribution  P(  (9) )  of  vector  9'  is  known,  then  it  is  possible 
to  find  unconditional  distribution  P(x,  y)  from  the  formula 


P(x,  </)  =  J  P~(x,  y)P(Q)dQ 


(21.5) 


(where  A~  —  range  of  all  possible  vector  Q  values)  and  to  use  it  in  the  normal 
manner  to  compute  average  risk  R. 

For  example,  relationship  (17.29)  is  valid  for  determinate  (non-randomized) 
decision  rule  (17.7)  and,  considering  (21.5),  optimum  decision  rule  FKp  iy)  may 
be  found  from  the  condition 


*=$  j  j  l(x,  T(y))P-(x,  y)P{?)dxdydO  =  mn. 


*  V  0 


(21.6) 


In  the  general  case,  when  decision  rule  A( y  /y)  may  be  randomized,  instead 
of  (21.6),  one  should  assume  that 


1111  HX'^P-£X'  y>  p  (0) 4 (Y | y) dx dy dy 


dB  -=min. 


(21.7) 


Hence,  from  the  point  of  view  of  principle,  it  follows  that  statistical  decision 
theory  easily  is  generalized  for  a  case  when  population  ^  of  the  parameters  of, 
a  priori  distribution  P(x,  y)  is  unknown,  but  does  exist,  and  distribution  P(  6  ) 
of  these  parameters  is  known.  However,  here,  computational  difficulties  may  rise 
sharply  here.  In  addition,  in  many  cases,  distribution  P("^)  is  unknown;  therefore, 
we  will  switch  to  examination  of  just  such  cases. 

One  possible  way  to  surmount  a  priori  difficulties  is  to  use  the  minimax 
method,  which  boils  down  to  the  following  in  this  case. 
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A  determination  is  made  of_the  least-favorable  type  P^(  0  )  of  this  distribution, 
i.  e.,  that  type  of  function  P(  Q  )  in  which  the  minimum  average  risk  R  value 
[determined  from  relationship  (21.6)  or  (21.7)3  is  maximum.  That  decision  /390 
rule  P (y)  [or  A(  v  /y)3  which  provides  minimum  average  risk  R  for  this  distribution 
PH(  0  )  is  optimum.  However,  mathematical  difficulties  in  determining  (computing) 
distribution  P(  0  )  in  a  number  of  cases  essentially  are  insurmountable.  In  addition, 
as  repeatedly  noted,  the  minimax  criterion  sometimes  is  too  "dangerous"  since 
the  actual  distribution  in  many  cases  may  differ  considerably  from  the  least  favorable 
one.  Therefore,  besides  the  minimax  method,  other  methods  for  overcoming  an  a 
priori  difficulty  also  are  of  interest. 

The  simplest  method  is  simply  to  assume  the  unknown  distribution  in  expression 
(21.6)  or  (21.7)  for  average  risk  is  uniform,  i.  e.,  to  consider  that 

P(0)  =  const.  (21.8) 

However,  in  a  number  of  cases,  especially  in  cases  of  filtration,  this  may  lead 
to  impermissible  system  design  errors.  Therefore,  several  authors  [160,  162, 
and  others]  propose  that  a  different  tack  be  taken,  this  being  to  require  that 
a  designed  receiver  provide  an  estimate  of  all  unknown  parameters  along  with 
message  x  reproduction.  Then,  with  respect  to  problem  conditions,  if  it  turns 
out  that  the  estimate  of  these  parameters  is  very  accurate^  there  is  a  further 
basis  on  which  to  consider  that  a  priori  distribution  P(  0  )  type  impacts  little 
on  reception  results  and,  consequently,  may  be  considered  uniform.  This  assertion 
to  a  significant  degree  is  analogous  to  the  Chapter  19  supposition  that  the  higher 
the  messageAfidelity,  the  less  distribution  P(x)  type  impact  and  all  the  more 
reason  to  consider  this  distribution  uniform. 


It  is  possible  to  explain  the  utility  of  introducing  estimation  of  unknown 
parameters  0  also  in  the  following  manner.  When  such  an  estimate  is  given, 
a  fortiori  distribution  P^(  $ )  is  computed  directly  or  indirectly  in  the  receiving 
device  and  it  is  considered  when  decision  V  is  taken  regarding  reproduced  usable 
message  x.  This  "smooths"  the  arbitrariness  that  a  priori  distribution  P(  0  ) 
is  assumed  to  be  uniform.  Evidently,  if  the  ideally-precise  estimate  of  parameters 
is  obtained*  then  the  "smoothing"  also  would  be  ideal,  i.  e.,  the  assumption 
concerning  the  uniformity  of  a  priori  distribution  P(  0  )  would  not  introduce 
any  additional  error  into  message  x  reproduction. 


However,  it  still  does  not  follow  from  this  that  replacement  of  the  simple 
assumption  about  distribution  P("$*)  uniformity  by  an  estimate  of  parameters 
always  is  advisable.  Actually,  it  follows  from  the  material  presented  in  preceding 
chapters,  for  example,  that,  when  measuring  signal  amplitude  (or  during  signal 
detection),  the  law  of  distribution  of  constant  (but  random)  signal  initial  /391 
phase  To  does  not  influence  reception  quality  if  it  is  high  (i.  e.,  signal-to-noise 
ratio  is  high).  Therefore,  during  design  of  a  receiver  system  operating  under 
such  conditions,  if  the  requirement  is  simultaneous  (with  detection)  initial  phase 
Vo  estimate,  then  this  will  lead  only  to  its  unjustified  complexity. 


But,  conversely,  it  is  evident  that,  in  a  number  of  cases,  introduction  of 
an  additional  estimate  of  unknown  signal  parameters  or  noise  may  provide  a  radical 
increase  in  reception  quality.  For  example,  when  measuring  the  amplitude  of  a 
signal  with  a  carrier  frequency  unknown  within  broad  limits  but  stable,  much  higher 
amplitude  measurement  accuracy  may  be  obtained  in  the  steady-state  mode  if  the 
receiver  performs,  in  addition,  signal  carrier  frequency  measurement  and,  conse¬ 
quently,  a  radical  narrowing  of  its  bandwidth  is  possible.  Therefore,  the  method 
of  overcoming  an  a  priori  difficulty  through  introduction  of  an  additional  estimate 
of  parasitic  signal  parameters  may  be  useful  in  a  number  of  cases.  From  the 
mathematical  point  of  view,  problem  formulation  will  boil  down  to  the  following. 

We  assume  that  the  designed  receiver  mustperform  not  only  estimate  (repro¬ 
duction)  v  of  usable  signal  x,  but  estimate  Q  of  unknown  parameters  Q  of 
distribution  P-~(x,y).  For  simplicity,  let  the  estimate  algorithms  be  determinate 
(non-randomized) ,  i.  e., 


7  =  f(y)  and  0  *  =  B(y)- 


(21.9) 


Estimate  quality  is  characterized  by  generalized  loss  function  /  (*.  9,  y.  0*),  which 
denotes  losses  with  varied  combinations  of  true  values  x  and  Q  and  of  their  estimates 
Y  and  Q  .  Then,  analogous  to  (21.6),  it  is  possible  to  write  the  following 
receiver  optimization: 


?  =  $  $  $/(*.  0-  r ({/).  B (y))  x 

*  »  « 

X  P-  (x,  y)  P  (0)  dx  du  d9  -  min. 

w 


(21.10) 
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Here,  the  minimum  is  sought  with  respect  to  all  possible  values  of  operators 
r (y)  and  B(y). 

Finding,  from  this  condition,  optimum  values  of  algorithm  (decision  rules) 

P  (y)  and  B(y)  requires  knowing  a  priori  distribution  P(  Q  ),  which  is  unknown 
to  us.  Therefore,  it  is  assumed  uniform: 

P  (0)  =  const 

(qualitative  substantiation  of  this  assumption  was  provided  above).  Here,  mathe¬ 
matically,  the  problem  solution  is  completely  determinate  and,  consequently,  possible 
in  principle.  However,  great  mathematical  difficulties  and  a  requirement  to  /39Z 
substantiate  the  complex  loss  function  /  (x,  0,  y,  0*)  type  remain. 

Therefore,  in  one  work  [162],  an  additional  assumption  is  made  for  the  sake 
of  simplicity:  the  assumption  is  that,  given  high  parameter  x  and  0  estimate 
accuracy  and  a  symmetrical  loss  function  form,  the  type  of  this  function  will 
not  play  a  role  and  one  may  assume  that  minimum  average  risk  R  [expression  (21.10)] 
coincides  with  the  maximum  with  respect  to  (  U  ,  x)  of  distribution  P~  (x,  y). 

Here,  optimum  estimate  algorithm  values  may  be  found  from  the  following,  simpler, 
condition: 

y=*„p  (y).  8*=ew(y).  (21.11) 

where  **p  (y)  and  0„p  (y)  —  values  of  parameters  x  and  for  which  probability 
density  P-  (x,  y)  ,  looked  upon  as  a  function  of  x  and  0,  has  a  maximum. 

Evidently,  this  method  is  a  generalization  of  the  method  presented  in  Chapter 
17  based  on  replacement  of  the  minimum  average  risk  criterion  by  the  maximum  a 
priori  (inverse)  probability  density  criterion.  However,  in  order  to  bring  problem 
solution  to  a  conclusion,  even  with  respect  to  the  simplified  criterion  (21.11), 
a  series  of  additional  assumptions  usually  must  be  made  (see  [162],  for  example). 

The  basic  drawback  of  the  aforementioned  methods  is  the  requirement  for  often 
insufficiently-substantiated  replacement  of  unknown  distribution  p(T>  by  a  uniform 
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distribution.  Therefore,  the  following  group  of  methods  in  which  this  assumption 
is  not  required  is  of  interest. 


The  group  of  methods  is  based  on  introduction  into  the  requirements  levied 
on  the  designed  receiver  of  the  additional  requirement  for  sliest  dependence  or 
complete  independence  of  its  structure  and/or  its  properties  on  unknown  distribution 
p<7>.  Naturally,  introduction  of  such  a  requirement  (limitation),  just  like 
any  additional  requirement,  in  the  general  case  worsens  the  achievable  value  of 
the  basic  receiver  quality  index  (it  increases  average  risk  R,  for  example). 

However,  there  is  no  arbitrariness  here  linked  with  selection  of  the  type  of  unknown 
distribution  P(  6  )  and  the  corresponding  arbitrariness  in  the  actual  basic  quality 
index  value. 

This  group  of  methods  at  present  is  more  developed  relative  to  binary  signal 
detection  and,  depending  on  the  type  of  additional  requirement  levied,  is  divided 
into  the  following  sub-groups: 

1.  Methods  based  on  the  requirement  for  invariance. 

2.  Methods  based  on  the  requirement  for  similarity. 

3.  Methods  based  on  the  false-alarm  probability  upper  bound. 

Since  false-alarm  probability  Pn r  and  miss  probability  Pn p  are  the  basic 
receiver  quality  indices  given  binary  detection  and  incomplete  a  priori  information, 
the  aforementioned  requirements  are  formulated  mathematically  in  the  following  /393 
manner: 


1.  Invariance.  Probabilities  Pn t  and  Pn p  must  not  depend  on  distribution 
P("2T )  (or  on^*). 


2.  Similarity  requirement.  Probability  Pn T  must  not  depend  on  distribution 
P(  $*)  (or  on  $  ),  i.  e. ,  given  a  change  in  P(  T>  (or~^),  this  condition  must 
be  met: 


(2i.  n) 


Pat—  const  =  P,,  T  0, 

where  Pa  to  —  given  (parmissible)  false-alarm  probability  magnitude. 

3.  Requirement  for  a  magnitude  P«  *  upper  bound.  Given  changes  in  P(  0  ) 
tor~ff )  ,  this  condition  must  be  met: 


Par  ^  Pa r  0* 


(21.13) 


It  should  be  noted  that,  when  any  of  the  aforementioned  conditions  are  met, 
so-called  impermissible  decision  rules  must  be  excluded  from  the  examination, 
i.  e. ,  those  algorithm  y  =  P(y)  types  in  which  decision  turns  out  independent 
of  input  realization  y.  [For  example,  it  is  simple  to  satisfy  the  invariance 
requirement  if  you  simply  shut  off  the  receiver*;  here,  Pa  T  =  0  and  Pap  =  1 
will  always  be  the  case  (for  any  Q  ).  It  is  evident  that  such  a  decision  rule 
is  absolutely  unsatisfactory]. 


Of  the  three  requirement  types  indicated,  the  first  is  the  most  rigid  and 
often  is  not  satisfied  at  all  (if  you  rule  out  impermissible  decision  rules), 
or  will  lead  to  a  significant  deterioration  in  quality  indices  Pa  t  and  Pnp  . 
Requirement  (21.13)  is  the  least  stringent  and,  in  a  number  of  cases,  may  be  satisfied 
with  less  quality  index  deterioration. 


In  some  particular  cases  (especially  given  a  large  independent  observation 
sample  size  n),  it  turns  out  that  invariance  algorithms,  similar  algorithms,  and 
algorithms  based  on  an  upper  magnitude  Pa  »  bound  coincide  among  themselves,  as 
well  as  with  optimum  algorithms  found  in  the  conventional  way  for  known  distribution 
p<T).  It  is  evident  that  this  occurs  in  those  cases  when  independent  a  priori 
distribution  P  (T)  type  does  not  impact  on  optimum  receiver  structure  and  properties. 
However,  in  the  general  case,  results  obtained  using  various  methods  do  not  coincide. 


*0r  there  is  no  reception  whatsoever  or  you  assert  that  a  signal  always  is 

absent. 


Several  results  V.  A.  Kora do  obtained  based  on  the  similarity  requirement  will 
be  presented  below  as  an  illustration  [163 — 167]. 

2.  An  Example  of  System  Design  of  Optimum  Detectors  Satisfying  the  Similarity 
Requirement  /394 

Consider  binary  detection  of  a  signal  on  a  background  of  gaussian  noise. 

The  assumption  is  that  intensities  ]J  and  of  signal  and  noise  are  the  only 
unknown  parameters  characterizing  distributions  PxQ(y)  and  P^(y).  Also  unknown 
is  the  ratio  of  these  intensities 


•  (21.14) 

Not  only  magnitudes  v  and  P  are  unknown,  the  laws  of  their  distribution  also 
are  unknown;  the  only  known  is  that  these  intensities  are  constant  during  detection 
time. 


When  a  system  is  designed  under  these  conditions,  a  limitation  is  imposed 
on  the  range  of  the  possible  signal-to-noise  ratio  values: 

(21.15) 

where  qQ  —  unknown  positive  magnitude.  The  requirement  for  this  magnitude  q 
limitation  below  some  non-zero  value  qQ  is  evident  since,  where  q  =  0,  the  signal 
a  fortiori  is  absent  and  the  signal  detection  problem  loses  its  meaning. 

The  first  requirement  levied  on  the  designed  system  is  similarity  requirement 
(21.12).  The  second  requirement  concerns  miss  probability  Pnp  . 

Given  unknown  v  and  p  ,  just  as  in  the  case  of  known  values  for  these 
parameters,  miss  probability  Pnp  will  depend,  not  on  absolute  v  and  JX  magnitudes 
but  only  on  their  ratio  q:  the  greater  the  q,  the  less  the  P Dp .  Here,  it  is 
desirable  to  find  that  decision  rule  r0BT(y)  which  would  provide  the  probability 
Pup  minimum  for  all  values  q  6  Aq  [where  A^  —  range  of  all  possible  parameter 
q  values  determined  from  inequality  (21.15)].  However,  in  the  general  case,  such 
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a  rule  does  not  exist.  Therefore,  a  more-modest  requirement  must  be  satisfied, 
namely  a  requirement  of  the  type 


m3kc  Pnp  {<7,  r^T)<MaKcPop(?,  T). 


(21.16) 


Here,  needed  optimum  decision  rule  Ton,  (y)  provides  the  minimum  [with  respect  to 
all  possible  decision  rules  P (y ) ]  of  the  miss  probability  maximum  [with  respect 
to  all  possible  values  of  unknown  parameter  q].  This  decision  rule  is  the  minimax 
rule  and  coincides  with  Bayes  decision  rule  r**p  (y)  for  least- favorable  a  priori 
distribution  P^tq)  [i.  e.,  of  distribution  P(q)  providing  the  maximum  of  the  minimum 
value  of  average  risk  R  determined  from  formula  (21.6).  Therefore,  it  suffices 
for  finding  rule  ronT  (y)  to.  find  the  Bayes  decision  rule  for  parameter  distribution. 
P^(q).  Here,  it  turns  out  that  the  optimum  decision  rule  has  the  following  form: 

decision  "yes"  (signal)  will  be  taken  if  /39S 


l(y)=?± 


J  P,,  .,  (y)  Pm  (?)  dq 


>C(gy); 


(21.17) 


decision  "no”  (no  signal)  conversely,  i.  e.,  when 


Here,  m  —  random  fixed  value  of  noise  intensity  |i  ;  gy  =  g(y)  —  sufficient 
[for  estimation  of  parameter  of  distribution  ]  statistic*  (more  precisely, 

*In  the  theory  of  mathematical  statistics  (see  [131],  for  example),  function 
gtyp  •  •  •»  yn)  Of  sample  value  population  (y^  .  .  .,  yn),  which  satisfies  the 

following  factorization  (special  expansion)  condition,  is  called  a  sufficient 
(for  estimation  of  probability  distribution  parameter  0  )  statistic  of  random 
process  y(t): 

po<v . y*)=M«<yi . y»).  0|A(yi . y„). 

(21.18) 

where  the  second  factor  will  not  depend  on  0  .  Using  this  relationship,  it  is 
possible  to  determine  sufficient  statistic  g(y)  corresponding  to  known  conditional 
distribution  Pa(yi.  ....  y»)  .  For  example,  if  y,,  .  .  .,  y  is  independent  and 

identically  distributed  in  accordance  with  the  Poisson  law,  then  (  s  ►,  I 

0\<«.  I  /r-nO 
p0  - - - 

n  <>>," 

i-t  I 
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realization  of  the  sufficient  statistic)  of  input  mixture  y(t);  C(g^)  —  function 
of  g^  found  in  accordance  with  similarity  requirement  (21.12)  from  the  condition 


^jtT  =  const  —P„ro 


for  all  values  of  sufficient  statistic 


V 


It  is  easy  to  become  convinced  from  comparing  the  left  side  of  inequality 
(21.17)  with  expression  (14.44)  that  it  is  a  likelihood  factor  averaged  with  respect 
to  all  possible  unknown  parameter  q  values.  Function  C(g^),  standing  on  the  right- 


Figure  21.1.  (a)  —  Computer?  (b)  —  C  [threshold  comparator]; 

(c)  —  P  [threshold];  (d)  —  Decision. 


hand  side  of  inequality  (21.17),  plays  the  role  of  some  threshold  with  which 
likelihood  factor  l(y)  value  is  compared.  Therefore,  the  optimum  detector  functional 
diagram  may  be  represented  as  shown  in  Figure  21.1.  It  differs  from  a  conventional 
Neyman — Pearson  detector  only  in  that  the  likelihood  factor  is  computed  for  least- 
favorable  (rather  than  the  actual)  distribution  PM(q),  while  threshold  bias  C 
is  not  constant,  but  is  computed  (with  the  aid  of  an  additional  channel)  for  every 
specific  realization  y(t). 


If  the  unknown  parameter  (parameters)  was  not  signal-to-noise  ratio  q  /396 
[or  the  population  of  intensities  (v.  fO  but  some  other  physical  parameter 
(parameters)  of  mixture  y(t),  then  computation  of  the  least-favorable  a  priori 
distribution  might  present  greater,  sometimes  even  insurmountable,  mathematical 
difficulties.  However,  in  the  case  under  examination,  when  this  parameter  is 
signal-to-noise  ratio  q  (or  the  population  of  intensities  v  and  M.  ),  distribution 
P^(q)  may  be  found  from  the  following  evident  considerations. 

and,  substituting  this  expression  with  the  right-hand  portion  of  relationship 
(21.18),  we  will  find  that  • 


The  1088  the  q,  the  greater  the  miss  probability  Pap  .  Therefore,  the  least 
favorable  is  that  distribution  P(q)  in  which  magnitude  q  always  equals  minimum- 
possible  value  qQ.  Consequently, 


(21.19) 


~  «- function. 


Considering  this  relationship,  algorithm  (21.17)  takes  the  form 


(21.20) 


In  many  practical  cases,  it  turns  out  that  distribution  P*,. ».  (y)  will  depend 
on  y  only  via  sufficient  statistic  g^.  In  this  case,  algorithm  (21.20)  is  simplified 


even  more  and  takes  the  form 


(21.21) 


where  C'(gy)  also  ia  some  function  of  sufficient  statistic 


It  should  be  noted  that,  in  a  majority  of  practical  cases,  there  is  no  success 


in  predetermined  substantiation  of  signal- to- noise  ratio  qQ  boundary  selection. 


Therefore,  it  often  is  more  proper  to  require,  not  minimization  of  the  ma>  mm 


miss  probability  Pap  value  occurring  when  q  s  qQ,  but  minimization  of  lower  bound 


qQ  corresponding  to  the  given  (permissible)  magnitude  Pup  •  of  miss  probability 
Pup  •  This  denotes  that,  instead  of  criterion  (21.16),  it  is  more  logical  to 
use  the  following  criterion: 


q  «  min  where  P**>  <  P„p  *  for  all  q  >  q 


(21.22) 


[retaining  here  additional  similarity  requirement  (21.12)].  Criterion  (21.22) 
will  lead  to  the  identical  (optimum)  decision  that  criterion  (21.16)  does  /397 
[while  retaining  in  both  eases  similarity  requirement  (21.12)  with  an  identical 
probability  Pa  ,  •  value]  if  the  following  correspondence  exists  between  values 


qQ  and  Pup  in  both  cease. 


mm 


Using  criterion  (21.16),  consider  the  case  where,  for  giv/en  (known)  value 
qQ,  we  found  the  minimum  value  of  the  maximum  (i.  e. ,  occurring  when  q  =  qQ)  miss 
probability  p up  m»»  and  corresponding  optimum  algorithm  Fonr  ty) .  Then,  using 
criterion  (21.22)  for  Pnp #  =  ^upMim ,  we  will  obtain  identical  decision  rule 
Font  (y)  ;  minimum  lower  bound  value  <1nHnn  corresponding  to  it  will  coincide  with 

value  qQ,  which  is  given  for  criterion  (21.16).  Here,  we  have,  in  essence,  the 
following  example  of  a  general  system  design  problem  based  on  two  quality  indices 
(this  general  problem  is  examined  in  the  next  chapter). 

When  using  both  criterion  (21.16)  and  (21.22),  there  are  two  designed  system 
quality  indices  which  it  is  desirable  to  minimize:  miss  probability  and  the 
value  of  signal- to-noise  ratio  q  lower  bound.  In  the  general  (nondegenerative) 
system  design  case,  it  is  impossible  to  select  that  decision  rule  P (y)  which 
would  minimize  P np  and  q  simultaneously  (see  Chapter  22  also).  Therefore,  in 
the  first  case  [criterion  (21.16)],  the  problem  posed  will  be  to  minimize  magnitude 
P nP  for  a  given  (fixed)  magnitude  qQ  while,  in  the  second  case  [criterion  (21.22)], 
it  is  to  minimize  q  for  a  given  value  Pn P  •  In  other  words,  in  the  first  case, 
during  the  system  design  process,  qQ  will  be  transferred  to  the  category  of  limi¬ 
tations,  while  Pnp  is  transferred  in  the  second  case.  However,  in  the  next  chapter, 
it  will  be  demonstrated  that  if,  as  a  result  of  system  design  based  on  one  of 
these  methods,  the  dependence  between  both  quality  indices  (in  this  case,  Pn P 
and  q  )  will  tum  out  to  be  strictly  monotonic,  then  the  system  design  result 
will  not  change  if  the  other  (second)  quality  index  is  transferred  to  the  category 
of  limitations  instead  of  the  given  quality  index. 

In  the  case  of  indices  q'  and  Pnp  we  are  examining,  the  dependence  between 
them  in  the  case  of  system  design  based  on  criterion  (21.16)  turns  out  to  be  strictly 
monotonic:  the  less  the  qQ,  the  greater  the  P np  and  vice  versa.  Therefore, 
from  the  point  of  view  of  principle,  criteria  (21.16)  and  (21.22)  turn  out  to 
be  equivalent.  However,  from  the  practical  point  of  view,  the  following  differences 
exist  between  these  two  criteria:  criterion  (21.16)  is  more  convenient  from  the 
point  of  view  of  the  mathematical  computations  required  to  find  optimum  algorithm 
Fi.Hfl;/)  ,  while  criterion  (21.22)  is  more  convenient  from  an  engineering  standpoint. 
Therefore,  in  similar  cases,  system  design  follows  criterion  (21.16)  and  will 
find  the  optimum  link  between  quality  indices  /’up  and  qQ,  i.  e. ,  a  performance 
curve  of  the  P np  —  /p  (q9)  type.  During  computations,  this  curve  is  used,  not  to 
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determine  magnitude  Pn p  for  a  given  qQ  (since  magnitude  qQ  usually  a  priori  is 
unknown),  but  rather  for  determination  of  magnitude  qQ  with  which  there  is  a  guarantee 
of  obtaining  the  maximum-permissible  miss  probability  P nP  » 
magnitude. 

So,  it  is  possible  (given  unknown  intensities  V  and  }Ji  )  to  use  the  opti¬ 
mization  described  by  relationships  (21.12)  and  (21.16)  and  leading  to  an  optimum 
decision  rule  of  the  (21.20)  or  (21.21)  type  for  synthesis  of  optimum  similar  /398 
algorithms  to  detect  and  plot  the  optimum  detector  performance  curve. 

Based  on  these  relationships,  in  [163  -  167]  optimum  detector  systems  were 
designed  and  their  performance  curves  for  several  specific  signal  types  were  de¬ 
termined  in  the  assumption  that  realization  y(t)  has  the  form  of  discrete  sample 
(y^,  .  .  .,  y^,  .  .  .,  yR).  Here,  as  could  be  expected,  by  virtue  of  the  increase 
is  sample  size  n,  power  losses  in  the  threshold  signal  (i.  e. ,  in  magnitude  qQ), 
obtained  given  unknown  magnitude  fJL  (compared  to  a  case  when  magnitude  JJ.  is  known), 
sharply  decrease  and,  where  n  >  23 — 50,  do  not  exceed  1  db.  Conversely,  when 
n  is  slight,  losses  may  be  very  git^at,  i.  e. ,  be  several  dozen  decibels. 

In  the  particular  case  when  the  signal  has  a  normal  law  of  distribution, 
the  Figure  21.1  circuit  takes  the  form  depicted  in  Figure  21.2.  In  this  circuit, 


Figure  21.2.  (a)  —  KB  [squarers];  (b)  —  P  (threshold  comparator]; 

(c)  —  Decision. 


B  —  linear  filter  (discrete);  KB  —  squarers;  2  __  adders  of  n  input  magnitudes; 

D^,  Dg,  and  D?  —  linear  systems  with  diagonal  transformation  matrices  (in  particular 
cases,  all  three  or  several  of  these  units  may  be  absent);  C  —  multiplier  to 
constant  number  C;  n  —  threshold  comparator:  if  ««>««  ,  then  the  decision 
that  there  is  a  signal  is  taken,  while  when  ua  <  ue  ,  then  the  no  signal  decision 
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is  taken.  In  both  circuits  (Figures  21.1  and  21.2),  the  presence  of  an  additional 
(lower)  channel  will  lead  to  an  automatic  threshold  change  (C  or  uc  )  when  noise 
intensity  changes,  which  insures,  in  particular,  that  the  requirement  for  false-alarm 
probability  constancy  is  met  [similarity  requirement  (21.12)]. 

21.3  Use  of  Nonparametric  Methods  of  Mathematical  Statistics 

It  already  was  noted  in  §  21.1  that  nonparametric  methods  are  used  in  those 
cases  when  not  only  distribution  parameters,  but  its  type  as  well,  are  rnknown. 
Nonparameteric  problems  are  complex  because,  given  an  unknown  law  of  distribution, 
in  the  general  case  it  is  impossible  in  principle  to  compute  the  magnitude  of 
the  average  risk  or  even  of  conditional  risks.  Consequently,  it  is  impossible 
to  estimate  the  quality  inherent  in  given  decision  rule  P  (y).  For  example, 
it  is  impossible  in  a  case  of  binary  signal  detection  to  compute  probabilities  /399 
P.i  t  and  P„ p  .  However,  it  turns  out  that,  given  certain  problem  formulations 
and  appropriate  decision  rule  selection,  it  is  possible  precisely  to  compute  at 
least  false-alarm  probability  P„  T ,  while  one  may  only  obtain  a  qualitative  answer 
relative  to  miss  probability  P up  .  In  particular,  one  usually  is  restricted  to 
cases  when  sample  (y^,  .  .  .,  yn)  of  n  statistically-independent  equally-distributed 
magnitudes,  while  sample  size  n  is  large  enough,  completely  characterizes  input 
realization  y(t). 

There  is  a  significant  number  of  nonparametric  methods  (see  [131],  for  example]. 
Ue  will  examine  only  several  of  them,  the  ones  which  may  be  applied  directly  to 
binary  signal  detection  problems. 

The  first  is  used  in  statistics  to  test  the  hypothesis  of  an  unknown  distribution 
parity  and  consists  of  the  following. 

Let  sample  (y^,  .  .  .,  y^,  .  .  .,  y  )  of  n  independent  and  identically- 
distributed  magnitudes  be  given.  The  requirement  is  to  test  hypothesis  H  that 
this  sample  will  belong  to  distribution  (integral)  F^(y^)  relative  to  alternative 
K  that  the  sample  belongs  to  distribution  F2(y^).  The  types  of  functions  F^(y^) 
and  ^(y^  are  unknown.  The  only  known  is  that  distribution  F^(y^)  is  even*, 

*Po8itive  and  negative  random  magnitude  values  are  equally  probable. 


i.  e. ,  it  satisfies  the  condition 


M— F.fo,),  (21.23) 

while  distribution  p£(y i)  does  not  satisfy  the  parity  condition,  i.  e., 

F*(-yt)+l~Ft(yt).  (21.24) 

The  following  procedure  may  be  used  to  obtain  acceptable  decision  rule  P  (y). 
Analog  random  magnitude  y.  is  quantized  into  binary  random  magnitude  u  given  a 
zero  threshold,  i.  e. ,  in  accordance  with  the  law 


where  yt>0 
where  ^  0. 


(21.25) 


The  sum  of  the  sample  values  then  is  formed  as  follows 

2=»^S  «(I4).  (21.26) 

Since  sample  elements  y^  are  independent  and  equally  distributed,  then  magnitude 
z  is  subordinate  to  a  binomial  law  of  distribution  with  parameters  n  and 

^  =  1  — ^(0).  (21.27) 

where  F(y^)  —  integral  law  of  distribution. 

If  hypothesis  H  is  valid,  i.  e.,  distribution  F(yi)  is  even,  then,  in  /400 
accordance  with  (21.23)  the  result  is 


p=  1-F,(0)  =  1-0.5-0.5.  (21.28) 

If  hypothesis  H  is  invalid,  then,  in  accordance  with  (21.24),  the  only  known  relative 
to  magnitude  p  is 

P  +  0, 5.  (21.29) 
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Consequently,  if  hypothesis  H  is  vslid,  then  the  magnitude  z  distribution  turns 
out  to  be  precisely  known,  in  spite  of  the  fact  that  distribution  F^(y^)  of  input 
cfata  y^  is  unknown.  This  stipulates  the  advisibility  of  preliminary  transformation 
of  input  data  (y^,  .  .  .,  yp)  into  magnitude  z  of  the  (21.26)  form. 

In  order  to  substantiate  the  algorithm  for  further  data  z  processing,  we 
will  examine  a  case  when  n  >  1  (evidently,  if  there  is  no  success  in  obtaining 
a  satisfactory  decision  given  a  large  sample  size  n,  then  it  is  all  the  npre 
impossible  to  do  so  when  n  is  small).  Where  n  ^  1,  it  is  possible  to  assume 
that  magnitude  z  has  normal  distribution,  i.  e., 

rw=7irxpl-!£sr!-  <a-*» 

It  is  not  difficult  to  determine  parameters  z  and  a*  of  this  distribution. 


Actually,  random  magiitude  u  may  have  only  values  1  and  0  with  probabilities 
p  and  (1  —  p),  respectively.  Therefore 


u«lp+0(I  —  p)=p. 


(21.31) 


But 


where  «*  =*  l*p-f  0*0  —  p)=»p. 
Consequently, 


o2*p-p*  =  p(I— p). 


(21.32) 


(21.33) 


But,  it  follows  from  (21.26)  [considering  independence  of  terms  u(y^)]  that 

z*»rt«  and  o\**nol. 


(21.34) 


Therefore,  the  result  is 


j  *=np  and  o*=np(l-p).  (21.35 

t 

i 

So,  the  decision  to  accept  (or  not  accept)  hypothesis  H  must  be  taken  based 
on  magnitude  z,  which  has  normal  distribution  (21.30)  with  parameters  determined 
from  relationships  (21.35).  If  hypothesis  H  is  valid,  i.  e. ,  distribution  F(y^) 
is  even,  then,  in  accordance  with  (21.28)  and  (21.35),  the  result  is 


(21.36 


It  turn 8  out  that  0  when  n — *  oo  and,  consequently,  it  is  possible  /401 

in  the  first  approximation  to  assume  that  z  ss  n/2.  If  hypothesis  H  is  invalid, 
then,  in  accordance  with  (21.29),  either  p<  1/2  or  p*>l/2  and,  z»z>n/ 2  or 
z  sv~z  <  n/2  ,  respectively. 


Therefore,  it  is  natural  to  select  that  algorithm  for  testing  the  correctness 
of  hypothesis  H: 


hypothesis  H  is  valid  if  magnitude  z  is  close  enough  to  n/2,  namely  if 


<‘<T 


(21.37 


Conversely,  when  macyiitude  z  exceeds  the  bounds  set  by  inequalities  (21.37),  hypo¬ 
thesis  H  is  invalid  and  must  be  rejected.  Here,  g  —  some  fixed  magnitude  (for 
example,  $  s  4),  selection  of  which  must  be  made  from  permissible  error  decision 
probabilities. 


Two  types  of  errors  are  possible  in  this  case:  rejection  of  hypothesis  H 
when  it  is  valid  and  acceptance  of  hypothesis  H  when  it  is  invalid.  Let  the 
probability  of  the  first  type  error  be  a  while  that  of  the  second  type*  be 
P  .  It  follows  from  (21.37)  that  a  is  the  probability  that  magnitude  z  will 
exceed  the  bounds  (threshold)  set  by  inequality  (21.37),  i.  e.,  will  deviate  from 


*In  mathematical  statistics,  a  is  called  criterion  level  of  significance, 
while  fi  is  called  criterion  cardinality. 


its  expected  value  (mean  value)  by  a  magnitude  exceeding  ,  where  ot  =  Yn/2 
—  magnitude  z  standard  deviation.  Since  magnitude  z  distribution  is  normal, 
then  it  is  evident  that  <*  vill  be  very  slight  already  when  I  >  4.  Magnitude 
a.  decreases  with  a  rise  in  £  ,  but  then  magnitude  J5  will  rise,  i.  e. ,  the  pro¬ 
bability  that  condition  (21.37)  is  satisfied  when  hypothesis  H  is  invalid.  Therefore, 
selected  magnitude  |  should  not  be  too  high,  but  such  that  probability  a  will 
equal  the  maximum  permissible  magnitude  a  . 

From  relationships  (21.30),  (21.35),  (21.36),  and  (21.37),  it  is  not  difficult 
to  obtain  the  following  error  probability  formulas: 


I  ? 

a  =>  1  — 2  — —  e-*v*  dx\ 
VTn  J 


*11  .Mi 

P~— x—  f  - —  ' 

V  2«  J  K'IS  J 


e-'*/2  dx. 


(21.38) 


(21.39) 


where 


/up  (I—  p) 


V  np  (1  —  p) 


It  follows  from  formula  (21.38)  that,  for  given  magnitude  of  probability  /402 
a  s  oq,  the  requisite  value  of  threshold  factor  J  may  be  determined  precisely 
from  probability  integral  tables.  Where  *  ^ ,  it  is  possible  to  assume 


]/ 2ln -i. 


(21.40) 


It  follows  from  (21.39)  that  computation  of  probability  ft  (acceptance  of 
hypothesis  H  when  it  is  invalid)  requires  know:ng  (along  with  already-selected 
magnitude  £)  parameter  p  of  a  Poisson  die  *on.  But,  in  cases  when  hypothesis 
H  is  invalid,  i.  e.,  unknown  distribution  F(>^  »  not  have  the  property  of 

parity,  parameter  p  is  unknown;  the  only  known  is  that  p  differs  from  0.5  [and 
this  difference  is  all  the  greater,  the  greater  the  distribution  F(y.)  asymmetry]. 
Therefore,  it  is  impossible  precisely  to  determine  the  probability  3  magnitude. 
However,  it  follows  from  this  formula  that,  when  p  4  1/2  and  n — vOO,  probability 
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will  strive  towards  zero.  Therefore,  one  may  assert  that  a  slight  probability 
p  value  will  be  insured,  given  a  large-enough  sample  size  n.  (Evidently,  the 
more  p  differs  from  1/2,  the  smaller  the  required  sample  size  n,  for  a  given 
magnitude). 


Thus,  a  decision  rule  based  on  relationships  (21.25),  (21.26),  and  (21.37) 
insures  (where  n  >1)  receipt  of  a  slight  probability  error  of  the  first  kind 
and  an  error  probability  of  the  second  kind  asymptotically  (where  n — *©o)  striving 
towards  zero.  Therefore,  such  a  decision  rule  may  be  considered  acceptable  under 
the  aforementioned  conditions  where  a  priori  information  is  lacking  almost  completely 

We  will  examine  the  following  problem  as  an  example  of  the  use  of  the  given 
decision  rule  for  receiver  system  design. 


Consider  binary  detection  of  a  signal  on  a  noise  background  based  on  analysis 
of  input  realization  y(t).  The  only  a  priori  data  on  the  signal  and  noise  is 
the  following. 

1.  It  is  known  that,  if  samples  (y^,  .  .  .,  y^  .  .  .,  yp)  with  interval 

A/  >  Afw„  T(where  T  —  time  allocated  for  detection)  are  taken  from  realization 
y(t) ,  then  sample  values  will  be  statistically  independent  and  have  identical 
distribution  F(y^). 

2.  Nothing  is  known  about  distribution  F(y^)  except  that,  when  there  is 
no  sicpral,  it  may  be  considered  even,  while  parity  is  disrupted  when  there  is 
a  signal.* 


Since 

(21.41) 

then  the  aforementioned  decision  rule  may  be  used.  To  do  so,  input  data  y(t) 
are  quantized  with  respect  to  time  with  intervalA/  —  A/^Cfor  formation  of  sample 


*In  other  worde,  when  there  is  no  signal,  positive  and  negative  sample  values 
y.  are  equally  probable,  while  signal  appearance  renders  them  non-equBlly  possible. 


(yx,  .  •  .,  yn)]  and  with  respect  to  magnitude  in  the  zero  threshold,  /4Q3 

resulting  in  formation  of  quantized  sample  values  u(y^).  Then,  these  data  are 
added  [in  accordance  with  law  (21.26)]  and  the  resultant  sum  is  compered  with 
thresholds 

**Mn  —  l ~~  and  rMa„c 

If  magnitude  z  does  not  exceed  these  bounds,  then  a  decision  is  taken  concerning 
parity  of  the  distribution  giving  rise  to  realization  y(t),  i.  e. ,  concerning 
signal  absence.  Conversely,  i.  e.,  when  any  threshold  is  exceeded,  the  decision 
taken  is  that  a  signal  is  present.  Here,  a  is  the  false-alarm  probability, 
while  ^9  is  miss  probability,  i.  e. , 

T.  P  =  /V  (21.42) 

Therefore,  being  given  the  permissible  probability  Pn  t  value,  it  is  possible 
to  determine  requisite  threshold  factor  |  value  from  formula  (21.38)  or  (where 
Pn  t<  IO-4  )  from  formula  (21.40).  The  decision  rule  (detector  operating  algorithm) 
selected  guarantees  the  given  slight  probability  P„  ,  magnitude  and  miss  probability 
P np  asymptotically  (where  n  — oo)  striving  towards  zero. 

We  now  will  examine  another  nonparametric  method  of  statistics  based  on  use 
of  the  congruence  criterion.  The  quantitative  measure  characterizing  the  divergence 
("disagreementn)  between  sample  value  probabilities  F^  (  )  and  (  )  is  called 
the  congruence  criterion  6(F^,  F^) .  Depending  on  the  nature  of  the  problem  posed, 
both  distributions  may  be  empirical,  or  one  of  them  may  be  empirical  and  the  other 
hypothet ical  ( approx imat e) . 

We  will  examine  the  problem  of  testing  hypothesis  H  concerning  the  affiliation 
of  two  samples  (x^,  ...»  xn)  and  (y^,  ...»  y^  with  independent  elements  with 
the  same  probability  distribution  against  alternative  hypothesis  K  that  these 
samples  are  affiliated  with  different  distributions. 

Empirical  distributions  F^Cz)  and  Fy(z)  built  from  samples  (x^,  .  .  . ,  xn) 
and  (y1f  .  .  .,  y  ),  respectively,  play  the  role  of  distributions  F.  and  F2  in 


this  case.  By  definition,  the  empirical  function  of  a  distribution  corresponding 
to  sample  (».,,  .  .  xn>  eqials 


F  (x) 


(21.43) 


where  I'^Cx)  —  number  of  sample  values  not  exceeding  some  threshold  x.  For  com¬ 
putation  of  y  (x),  sample  values  (x^,  .  .  .,  xn)  are  presorted  into  ascending 
magnitudes,  i.  e.,  in  the  form  of  a  so-called  variational  series. 


The  empirical  funtion  of  distribution  F(y)  corresponding  to  sample  (y^,  . 


.  .,  yn)  is  determined  in  an  analogous  manner: 


(21.44) 


Fx(z)~F(x),.„  | 
F,(z)-F(y),-'.  I 


(21.45) 


So,  to  teat  hypothesis  H  that  distributions  F  (z)  and  F  (z)  coincide  against 

x  y 

alternative  K  that  they  do  not  coincide  requires  preliminary  selection  of  the 
congruence  criterion  type,  i.  e.,  the  type  of  function  (or  functional)  A(F  ,  F  ) 

y 

characterizing  the  degree  of  divergence  between  F  and  F  .  Different  congruence 

x  y 

criteria,  each  having  inherent  advantages  and  disadvantages,  correspond  to  the 
different  types  of  this  function  (functional).  We  will  examine  as  our  example 
two  congruence  criterion  types: 


The  first  congruence  criterion  has  the  form 


A -max  | Fx(z)-F,(z)l. 


(21.46) 


As  N.  V.  Smirnov  demonstrated  [131],  when  n-+oo 


pj(±+-L)-,/2A>rj-2  *> 0,  (21.47) 
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where  P  (  )  —  probability  of  the  event  enclosed  in  brackets.  The  decision  rule 
may  be  selected  in  the  following  form:  hypothesis  H  is  true  [samples  x^,  .  .  . , 
xn)  and  (y^,  .  .  .,  yn)  will  be  affiliated  with  the  same  distribution]  if 


(21.48) 


If  inequality  (21.48)  is  not  satisfied,  then  hypothesis  H  is  untrue,  i.  e. ,  alter¬ 
native  hypothesis  K  is  valid.  Here,  A^  —  some  threshold  selected  from  the  per¬ 
missible  value  of  probability  C(  of  rejecting  hypothesis  H  when  it  is  true. 


It  is  possible  on  the  basis  of  relationships  (21.47)  and  (21.48)  to  find 
the  link  between  probability  CC  and  threshold  A ^  ,  making  it  possible  to  select 
magnitude  A^  in  accordance  with  the  given  (permissible)  probability  CC  value. 
Actually,  it  follows  from  determination  of  probability  CX  that 


a  =  P{4>A«).  (21.49) 

given  that  hypothesis  H  is  true.  But,  if  hypothesis  H  is  true,  then  the  distribution 
of  magnitude  A  is  subordinate  to  law  (21.47).  Therefore,  formula  (21.^7)  may 
be  used  for  determination  of  probability «  =  P  |A>  A«| .  Evidently,  it  is  possible 
to  write  this  formula  in  the  following  way  as  well: 

Substituting  this  expression  into  (21.49),  we  obtain 

Given  slight  values  of  C£  ,  it  is  possible  to  restrict  ourselves  to  the  /405 
first  term  of  the  sum,  i.  e.,  to  assume 
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hence,  the  result  is 


Another  congruence  criterion  (named  the  Uilcoxon  criterion  for  its  author) 

is  based  on  a  count  of  the  number  of  inversions.  In  the  problem  formulated  above, 

when  this  criterion  is  used,  both  samples  (xlf  .  .  .,  x  )  and  (y1t  .  .  .,  y  ) 

jl  n  x  /fi 

are  in  the  form  of  one  (common)  variational  series,  for  example: 

xt>  xt>  Ht>  xt,  I/,*  xt, ... 


If,  in  this  series,  k  elements  of  sample  (y. ,  .  .  . ,  y  )  precede  given  x/  then 

i  * 

k  inversions  occur.  The  total  number  of  inversions  A  equals  the  sum  of  the  inversions 
formed  by  all  elements  of  the  first  sample  with  the  elements  of  the  second. 


Where  m  +  n  >20  and  m  >  3,  it  is  possible  approximately  to  consider  that 
magnitude  A  has  normal  distribution  with  parameters 

oi=  -22(m+/t  +  l).  (21.50) 

The  decision  rule  has  the  following  form 

if  (21.51) 


then  hypothesis  H  is  accepted,  while,  conversely,  hypothesis  K  is  accepted. 


It  is  possible  to  use  relationships  (21.50)  and  (21.51)  to  find  the  link 
between  probability  Ct  of  hypothesis  H  rejection  when  it  is  true  and  threshold 

Aa' 


(21.52) 


where  x^  is  determined  from  relationship 


_J _ f  *  =  «• 

V B  J 

*a 
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(21.53) 


When  (X  <  10  ,  it  is  possible  to  assume  that 


x<*  &  ]/^2  In  — . 


(a.  54) 


As  is  evident  from  the  examples  presented,  congruence  criteria  make  it  possible 
(given  large  enough  samples)  precisely  to  determine  probability  C£  of  an  error 
of  the  first  type  (rejection  of  hypothesis  H  when  it  is  true),  but  do  not  /406 
make  it  possible  to  determine  probability  of  an  error  of  the  second  type  (accept¬ 
ance  of  hypothesis  H  when  it  is  untrue). 

It  is  possible  to  solve,  for  example,  the  following  problem  of  detecting 
a  signal  on  a  noise  background  using  the  aforementioned  methods  based  on  the  con¬ 
gruence  criterion. 

The  requirement  is  to  establish  whether  or  not  a  signal  is  present  in  realization 
y,n(0  given  the  presence  of  additional  (pattern)  realization  y(2)w  .  it  is 
known  that  there  cannot  be  a  signal  in  realization  y<2)U)  and  the  statistical 
characteristics  of  the  noise  in  realizations  and  yli)(t)  are  identical. 

The  duration  of  the  first  realization  is  T^,  while  the  duration  of  the  signal 
appearing  in  it  also  is  T^.  The  duration  of  the  second  realization  is  T^.  Durations 
T^  and  T^  are  so  long  that  it  is  possible  to  obtain  from  realizations  and 

yi2){i)  n  and  m  independent  sample  values,  respectively,  where 


and  m»l. 


(21.55) 


The  fact  that  the  laws  of  distribution  of  the  sample  values  within  each  reali¬ 
zation  are  identical  is  assumed  to  be  known.  Here,  uniform  (integral)  distribution 
Fj(z)  completely  characterizes  the  law  of  distribution  of  the  first  sample,  while 
uniform  distribution  Fg(z)  completely  characterizes  the  second  sample.  If  there 
is  no  signal  in  the  first  sample,  then  distribution  F^(z)  coincides  with  distribution 
F2(z).  Appearance  of  a  signal  (in  the  first  sample)  causes  F^(z)  to  deviate  from 
F2(z). 

Consequently,  testing  the  hypothesis. concerning  the  coincidence  of  distributions 
Ft  and  F2  is  fully  equivalent  to  testing  the  hypothesis  concerning  signal  absence. 
Hypothesis  K  concerning  signal  presence  is  the  alternative  hypothesis.  Hence, 


it  followe  that,  under  the  given  conditions,  any  of  the  congruence  criteria  described 
above  nay  be  used.  Here,  samples  obtained  from  realizations  yin  U)  and  y<2»  (0 
.pteytherole  of  samples  (xj,  .  *  ,,  xfl)  and  (y^,  .  .  ym),  respectively,  and 
CC  and  /J  are  falae-alarm  and  signal  miss  probabilities,  respectively  . 

These  examples  clearly  illustrate  the  following  special  features  of  nonparametric 
methods  of  statistics.  These  methods  make  it  possible  to  choose  a  receiver  operating 
algorithm  insuring  precise  equality  of  fttlem-alarm  probability  P«  T  of  a  give  n 
(permissible)  magnitude,  even  for  very-slight  s  priori  information  concerning 
the  less  of  signal- plus-noise  distribution.  However,  the  paucity  of  this  information 
leads  to  the  fact  that  there  is  no  success  in.  determining  the  precise  miss  probability 
P»p  value;  in  the  best  case,  it  is  possible  to  Obtain  only  a  qualitative  guarantee 
of  the  smellnese  of  this  probability  , when  n~ — *“  ®o.  In  addition,  the  aforementioned 
nonparametric  methods  have  the  following  serious  deficiencies: 

*•  m  ,  • ,  .  i  "?C-  '  ..  ,vt 

1*  There  is  a  requirement  for  a  large  number  of  ^afeistically-independent 
sample  values,  which  lssds  to  the  necessity  to  select  a  large  enough  time  interval 
between  acQeceat  reelization  sample  values  and,  correspondingly,  to  lose  some 
of  the  usable  information  contained  in  these  realizations. 

Z,  Each  of  the  described  me^hodc  ia  valid  only  for  a  relatively-narrow  class  . 
of  problems.  For  example,  there  lathe  requirement  that  the  low  of  distribution  /407 
be  even  whan  there  is  no  signal  or  that  an  additional  ("pattern”)  sample  exist, 
which  does  not  a  fortiori  contain  a  signal.  In  addition,  there  is  a  requirement 
that  the  law  of  distribution  of  oil  sample  values  (within  a  given  sample)  be 
identical..' 


5.  In  the  general  c»oe,  th«e|r  ie  success  only  in  finding  acceptable  algorithms 
r  (y),  rather  than  the  beet-poseibl e  ones  (within  the  framework  of  the  given  quality 
criterion). 

However,  the  theory  of  nonparametric  methods  of  statistics  is  in  a  state 
of  rapid  development  and  it  may  turn  out  to  be  useful  for  a  number  of  problame 
with  very  alight  e  priori  information. 


21.4  Use  of  the  Theory  of  Stochastic  Approximation 


Robbins  and  Monro^for  the  first  time  proposed  a  method  of  stochastic  approx¬ 
imation  to  find  the  estimate  of  the  root  of  a  regression  equation. 


Regression  is  one  of  the  characteristics  of  the  statistical  link  between 
two  random  magnitudes  x  and  y  and,  by  definition,  equals  the  expected  value  of 
one  of  these  magnitudes,  given  a  fixed  value  of  the  other,  i.  e. ,  M  (y)  —  reg^ssion 
of  y  to  x;  My(x)  —  regression  of  x  to  y. 

Dependence  M  (y)  =  m(x)  (or  dependence  of  M  (x)  on  y)  is  called  a  regression 
x  y 

function  (curve),  while  the  equation 


m(x)  =  0  (21.56) 

is  a  regression  equation.  Function  m(x)  has  a  very-valuable  property:  if  one 

* 

accepts  as  the  estimate  y  of  dependence  y  on  x  functional  dependence  of  the  type 

»*="(*)  (21.57) 

then  the  standard  deviation  of  y  from  y  will  be  minimal. 

Robbins  and  Monro  proposed  an  iterative  procedure  of  so-called  stochastic 
approximation  for  finding  estimate  xQ  of  root  xq  of  regression  equation  (21.56) 
given  sample  (y^,  .  .  . ,  y^,  .  .  .,  yn)  with  elements  having  some  conditional 
distribution  Px(y)<)  [in  the  supposition  that  equation  (21.56)  has  a  total  of  one 
root].  Here,  functions  P  (y. )  and  m(x)  are  considered  unknown  so  that  estimate 

♦  X  K 

xQ  of  the  magnitude  of  the  equation  (21.56)  root  must  be  given  only  based  on 
sample  (y^,  .  .  .,  yk,  .  .  .,  yn)  for  each  element  y^  of  which 

Mx.  («/*)=/»(**)• 

* 

Estimate  xQ  of  the  needed  root  will  be  found  by  means  of  the  n-th  use  of 
recurrent  relationship 


*axl+akyk, 


(21.58) 


where  k  =  1,  .  .  .,  n.  Here,  a  random  constant  number  may  be  selected  as  x^ 

=  Xj. 

If  coefficients  a^  satisfy  the  conditions+  /408 


S®**00*  2  (21.59) 

*— i  i 


then,  given  an  unrestricted  increase  in  sample  size  (n  — estimate  x*  +  . 
will  diverge  from  the  probability  of  unity  to  needed  root  xq,  i.  e., 


’( limx« 

\  n-*m 


Xn+  I  —  ■Koj  —  1  • 


(21.60) 


Conditions  (21.59)  are  satisfied  in  particular  if 

an^Vn.  (21.61) 

Later,  Kiefer  and  Uolfowitz  (175]  proposed  a  corresponding  procedure  of 
stochastic  approximation  for  finding  the  extremum  of  regression  function  m(x) 

(in  the  supposition  that  there  is  only  one  extremum). 

Ya.  Z.  Tsypkin  in  his  works  [132,  172,  198]  demonstrated  that  stochastic 
approximation  algorithms  successfully  may  be  used  in  design  of  various  information 
retrieval  systems  and  control  systems.  We  will  shift  to  a  brief  exposition  of 
the  main  content  of  Tsypkin's  work  relative  to  radio  reception  system  design  problems 


Let  the  following  population  be  subjected  to  system  design  (optimum  selection) 

S  =  (c,,  Cft ...»  c/f)  ( 21.62) 

of  receiver  parameters  c^,  c ^  .  .  .,  c^,  while  this  receiver's  quality  index 
may  be  represented  in  the  form 

?  =  ?(C)=j  Q(y,C)P(y)dy~M{Q(j/tC)},  (21>63) 

+In  addition  to  restrictions  (21.59)  on  coefficients  a^,  some  (usually  insig¬ 
nificant)  restrictions  also  are  placed  on  the  values  of  conditional  (for  a  given 
x)  expected  value  and  variance  of  random  variable  y  (see  [195],  for  example). 
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where  function  Q(y,  2)  considers  the  dependence  of  the  quality  index  both  on  vector 
"?  and  on  input  realization  y  =  y(t).  [In  the  case  of  discrete  sample  y  =  (y^, 

•  .  .»  yn)3. 

Optimum  vector  C  value  CHV  is  considered  that  value  to  which  the  magnitude 
%  extremum  (maximum  or  minimum,  depending  on  the  type  of  problem  being  solved) 
corresponds.  For  simplicity,  in  future  (but  without  disrupting  the  generality 
of  the  results)  it  is  possible  to  assume  that  the  minimum  is  this  extremum. 

There  is  success  in  reducing  the  majority  of  quality  indices  used  at  the 
present  time  to  the  (21.63)  form,  particularly  average  risk  R.  If  function  /409 
y  (O  is  differentiated  for  C,  then  the  requisite  condition  of  the  magnitude 
ty  extremum  has  the  following  form: 


V?  (Q  =  o, 


(21.64) 


where 


. 


(21.65) 


—  gradient  of  vector-?.  It  follows  from  (21.63)  and  (21.64)  that  the  extremum 
condition  also  may  be  written: 


M(veQ(y.  P(=0.  (21.66) 

where 

V.«(*9-!|2 . It;)  (21.67) 

I  *  * 

—  gradient  of  vector  Q(y,~c}  with  respect  to-?. 

Vector  C  optimum  value  C ,  i.  e.,  the  value  aatisfying  equation  (21.66), 
may  be  found  from  the  following  iterative  algorithm,  which  ia  one  of  the  variations 
of  the  stochastic  approximation  algorithm: 


C[nl-<?M-H  Cln-  IJ). 


(21.68) 


where  *ctn]  and  C[n  —  1]  —  needed  vector  values  in  the  n-th  and  (n  —  l)-th 
approximation  step.  Initial  vector  C  valueTtl]  may  be  random;  A[n]  —  diagonal 
N  x  N  matrix  of  the  type 


A[n]  = 


ai  (a].  0 . 0 

0,  a, [/I). ....  0 


0,  0, ....  0,  aN  t«l 


(21.69) 


Simpler  is  the  particular  case  when,  instead  of  (21.69),  the  following  is  selected 


A  In]  =  la[n]. 


(21.70) 


where  1  —  identity  matrix. 


Evidently,  case  (21.69)  differs  from  (21.70)  in  that  the  values  of  coefficients 
a^[n],  .  .  .,  a^[n]  selected  for  different  vector  C  components  c^,  .  .  . ,  cN  are 


not  identical. 


Algorithm  (21.68)  differs  from  the  conventional  algorithm  of  an  extremum 
gradient  search  in  that  gradient  VeQ  will  depend  on  random  variable  y  and,  con¬ 
sequently,  is  random.  Therefore,  special  substantiation  is  required  for  the  con¬ 
vergence  (when  n  - *oo)  0f  magnitude  C[n]  to  the  equation  (21.66)  root,  i.  e., 

to  extreme  value  C*p  . 

Analogous  to  the  case  where  coefficient  an  selection  in  accordance  with  /410 
condition  (21.59)  is  required  to  insure  convergence  of  algorithm  (21.58)  in  a 
Robbins  and  Monro  problem,  corresponding  selection  of  coefficients  a^[n]  of  matrix 
A[n]  is  required  for  algorithm  (21.68)  convergence.*  But,  when  solving  practical 
problems,  there  is  a  requirement  that  not  only  the  algorithm  will  converge  when 

n  - *»,  but  that  it  will  do  so  rapidly  enough*  Therefore,  substantiation  of 

the  best  selection  of  the  type  of  matrix  A[n]  or  [in  the  case  of  (21.70)]  dependence 


*In  addition,  several  additional  conditions  analogous  to  those  indicated  in 
the  page  584  note  must  be  satisfied. 


as 


a[n]  is  the  central  task  in  the  theory  of  stochastic  approximation.  This  task 
still  is  far  from  satisfactory  solution,  but  some  useful  results  already  have 
been  obtained.  For  example,  sufficient  conditions  of  convergence  with  a  probability 
of  unity  for  algorithm  (21.68)  [for  the  (21.70)  case]  is  presented  in  [132,  page 
72].  They  include  inequalities  (21.59  and  several  additional  conditions. 

So,  if  coefficient  A[n]  in  algorithm  (21.68)  satisfies  the  convergence  con¬ 
ditions  ,  then  this  algorithm  makes  it  possible,  when  n  — co,  to  obtain  optimum 
value  C„ p  of  the  system  parameter  vector.  In  future,  we  will  assume  for  simplicity 
that  A[n]  is  selected  in  accordance  with  (21.70),  i.  e., 

CI«]  -  C  In  - 1 J  -la  [n]  V  e  Q  (y  [nj.  C  [n  - 1 J),  (21.71) 

where  C[l]  —  random  vector-?  initial  value. 

In  a  number  of  problems,  there  is  no  success  in  computing  gradient 
Q  III*  Q  [because  of  the  nondifferentiability  of  function  Q(y,  C)  with  respect 
to  C,  for  example].  In  such  cases,  gradient  Q  may  be  found  (measured)  approx¬ 
imately,  for  example,  using  the  following  retrieval  method. 

We  assume  that 


Ve  Q(y,Q&  VcQ  (!/•  c>  t’)< 


(21.72) 


where 


VcQU/’  C,  b)- 


Q.  (y,  C,  b)-Q_(y,C,b) 
- 25 - 


<?+  ( y ,  C,  b)  -  (Q  (y,  C  +  bej . Q  («/.  C  -I-  Oe.v) ) . 

Q-  ( y ,  C,  b)  =  {<?  {y,  C—  bet) . Q{y,C — fce,v)}  • 


(21.73) 
All 

(21.74) 


Here,  b  —  scalar,  e^i  :  1,  .  .  .,  N)  —  basis  vectors.  In  the  simplest  case 


e ,  —  ( ! ,  0, ....  0);  **=(0,  I,  0, ...»  0) .  e,v  =  (0,  0, ....  0,  1). 


(21.75) 


Magnitude  Q  Uj<  C,  b)  may  be  determined,  for  example,  with  the  aid  of  a 
synchronous  detector  (Figure  21.3)  in  which  reference  voltage  has  the  form  of 


Of t,T+*slgn(sin<Jo  tj] 


(a) 


X 


-Eh 


bVeQ(z£6) 


sign  (sin  ufftj 


Figure  21.3.  (a)  —  F  [filter]. 


a  rectangular  identity  function  sign  (sin  «•»#/).  .  Filter  (p  at  synchronous  detector 

output  suppresses  rf  components.  Here,  the  structural  schematic  of  a  discrete 
servomechanism  realizing  algorithm  (21.71)  [in  which  gradient  ^cQ{i/,C) is  computed 


Figure  21.4.  (a)  —  F  [filter]?  (b)  —  DI  [discrete  integrator]. 

from  formula  (21.72)]  has  the  form  depicted  in  Figure  21.4.  Vector  links  are 
designated  by  double  lines;  and  K2  —  commutators;  *P  —  filter;  JW  —  discrete 
integrator  (digrator),  which  converts  difference  &C[n  —  1]  s  C[n]  — "ctn  —  1] 
into'Stn  —  1].  It  will  comprise  delay  element  (by  one  step)  33  (Figure  21.5) 
enclosed  by  unitary  positive  feedback. 


9 

I 

■ 

Figure  21.5.  (a)  —  EZ  [delay  element]. 


When  the  retrieval  method  is  used  to  determine  the  gradient,  i.  e.,  in  accordance 
with  expressions  (21.72)— (21.74),  optimum  vector  C  value  computational  algorithm 
(21.71)  takes  the  following  form: 

C  l«)  =  C  \n  - 1 )  -  la  t«)  V,  Q  (y  l«).  Cl«  -  1 1).  (21.76) 

In  a  number  of  cases,  instead  of  discrete  algorithm  (21.71),  it  is  more  /412 
convenient  to  use  an  analog  algorithm  of  the  type 

d££L=—a (/)  VeQ (y (0>  C (/)).  (21.77) 

dt 

When  n - solution  of  this  equation  will  strive  towards  optimum  value 

Cnp  of  the  system  parameter  vector,  given  any  initial  value  of  vector'?  =  C(0). 
Coefficient  a(t)  is  a  continuous  analog  of  discrete  function  a[n]  and  must  be 
selected  from  the  condition  of  the  fastest  vector  ?(t)  convergence  to  C*p  .  The 
discrete  algorithm  is  convenient  for  realization  in  digital  computers,  while  the 
analog  algorithm  is  convenient  for  analog  computers. 

We  will  examine  the  following  two  examples  of  the  multitude  presented  Airing 
explanation  of  the  methodology  from  [132,  196]. 

Example  1.  Use  in  design  of  a  Wiener  optimum  linear  filter  system. 

The  problem  of  linear  filtration  in  the  Wiener  formulation  was  examined  in 
§  2.2.  It  will  consist  of  the  determination  of  impulse  response  »i  (t)  of  a  linear 
stationary  filter  providing  the  minimum  message  reproduction  mean-square  error, 
i.  e.,  the  minimum  of  magnitude 


? -AMMO- 1(01*. 

where  h(t)  s«^"(p)uc(t)  —  desired  effect  at  filter  output,  while 

m 

T)T|(T)rfT 

0 

—  actual  oscillation  at  filter  output) 


(21.78) 


(21.79) 
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y  (<)  =  “e  (O  +  Uui  (<) 


—  sum  of  random  signal  and  noise  at  filter  input. 

Formula  (21.79)  applies  to  a  case  when  signal- plus-noise  y(t)  reaches  filter 

input  at  moment  t  =  —  If  t  =  0,  then,  instead  of  (21.79),  one  should 

o  o 

assume  that 


— T)t|(T)rfT. 

0 


(21.79a) 


Here,  formula  (21.78)  takes  the  following  form: 


7 -All*  (*)  y  (<-T)dT]». 

0 


(21.80) 


It  was  noted  in  §  2.2  that  finding  the  optimum  impulse  response  lonr  (Q  providing 
the  minimum  of  functional  7  requires  knowledge  of  signal  and  noise  correlation 
functions.  If  these  functions  are  unknown,  then  it  is  possible  to  use  the  stochastic 
approximation  methodology  presented  above.  To  do  so,  the  needed  impulse  reaction 
is  represented  in  the  form 


*1  (0-Ci Vi  (0+c,  Vi (0+  •■•+cn  9w(0. 


(21.81) 


where  v^(t),  .  .  .,  $  N(t)  —  known  (preselected)  linearly-in dependent  time 

functions.  Here,  the  problem  of  finding  optimum  impulse  response  (t)  boils 
down  to  finding  optimum  parameter  vector  C. 


Expression  (21.81)  may  be  written  in  abbreviated  form: 


ij«)-CrvU), 


(21.82) 


~cr  —  transposed  vector  C  (vector-line),  while  <p(t)  —  N- dimensional  vector, 
whose  components  are  function  ^(t),  .  ,  .,  <r>N(t).  Substituting  (21.82)  into 
(21.80),  we  obtain 


(21.83) 


♦  <0- —  X)dX. 


(21.84) 


Comparing  expressions  (21.83)  and  (21.84)  with  (21.63),  it  is  easy  to  become 
convinced  that,  in  this  case,  the  role  of  function  Q(y, "?)  is  played  by  function 


Q  (y .  Cj  -  [a  (0  -ZT ♦  «)1 *  - 1  *  (0  -C r  1 9  (t)  *(«-*)  «h  j ! 


(21.85) 


Therefore,  it  is  possible  to  use  algorithm  (21.77)  to  find  optimum  vector"tT value 

Cup  ,  i.  e.,  to  find  C*p  as  a  steady-state  (where  n - ►  oo)  value  of  solution 

C(t)  of  the  differential  equation 


at 


But,  it  follows  from  (21.67),  (21.84),  and  (21.85)  that 


(21.86) 


Vc  Q  (0 .  C  (0)  -  -2  [a  (<)-Cr  ♦(/))  ♦(/). 


Therefore,  algorithm  (21.86)  has  the  following  form: 


.^!=2a «)[*«) -Cr?(o]  ♦  (<>• 

As  noted  above,  the  solution  of  this  equation  will  converge  (when  t  - 
needed  optimum  value  C*p  for  any  value  of  initial  condition ”?(0) 


(21.87) 


'OO)  towards 


It  follows  from  relationship  (21.87)  that  it  is  necessary  to  assume  desired 
filter  output  reaction  h(t)  =  (p)  u,(t)  to  determine  optimum  value  c„p  of  parameter 

vector  But,  to  do  so,  one  must  have  a  realization  of  signal  u  (t)  forming 

C 

part  of  signal-plus-noise  y(t).  In  the  filter  operating  mode,  it  is  impossible 
in  principle  to  provide  this.  Therefore,  when  algorithm  (21.87)  is  used,  it  is 
necessary  to  supply  to  filter  input  special  pattern  realizations 

M(0*«ep(0  +  «a»(0. 


which  allow  shaping  of  desired  output  realizations 


*( t)-3>  Ip)  «c»(0. 


(21.88) 


where  *Mp)  —  known  (given)  linear  operator. 

After  the  pattern  process  is  completed,  i.  e. ,  the  estimate  of  the  optimum 

parameter  vector  CHp  value  is  found,  the  designed  filter  may  be  switched  to  the 

operating  mode,  in  which  operating  realizations  y(t),  which  contain  unknown  /414 

signal  u  (t),  reach  its  input.  Strictly  speaking,  C  will  converge  to  C„p  only 
c 

when  t - ►  «>,  i.  e. ,  the  pattern  process  must  last  an  infinitely-long  time. 

However,  if  the  vector  C(t)  convergence  to  CKp  is  good  enough  (fast  enough)*  then 
it  is  possible  to  assume,  with  accuracy  satisfactory  for  practical  purposes,  that 
pattern  time  is  finite. 

To  illustrate  these  suppositions,  we  will  examine  the  simplest  case  when 
a  filter  impulse  response  is  sought  in  the  form 


t|«)-ctei  (0- 


(21.89) 


This  denotes  that  the  impulse  response  shape  is  considered  given  and  unknown, 
i.  e. ,  only  scale  factor  c^  is  subject  to  system  design. 

Here,  vector  differential  equation  (21.87)  is  converted  into  a  conventional 
(scalar)  equation 


— =2b(0  (A<0-c»tMhl*i<0. 
where 

i 

f  i  (0—  J  W  y*  (< — t)  dt; 

o 

4- ««#(/)• 


(21.90) 


The  Figure  21.6  structural  schematic  corresponds  to  these  expressions.  In 
this  figure,  0^  —  linear  filter  with  impulse  response  ^(t);  *P2  ~  linear 
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filter  with  given  operator  transfer  constant  ®(p);  H  —  integrator;  ni . 

—  switches  from  the  pattern  mode  (position  1)  to  the  operating  mode  (position  2). 


Determinate  functions  ^(t)  and  a(t)  are  assumed  to  be  selected  prior  to  beginning 
the  pattern;  uCQ,  uc(t),  (0  »  and  ua  (/)  —  stationary  random  processes  (with 
zero  mean  values).  The  statistical  characteristics  (correlation  functions)  of 
th**se  processes  may  be  unknown;  it  is  important  only  that  these  characteristics 
do  not  change  during  the  transition  from  «»•  (0  and  uct£ft°  (0  and  uc(t). 

During  the  pattern  process,  realizations  yQ(t)  of  the  pattern  random  process 
reach  filter  input.  Usable  signal  realizations  ucQ(t)  included  in  this  process 
are  known  here  and  are  used  simultaneously  to  form  the  desired  effect  h(t)  at 
filter  output.  Evidently,  function  h(t)  plays  the  role  of  "teacher:”  it  "indicates" 
to  the  filter  which  realization  'J (t)  must  be  at  output  if  there  is  no  noise. 

During  the  pattern  process,  needed  parameter  c^  is  a  time  function  and,  in  /415 

the  steady-state  mode  (where  n - ►  oo),  it  must  equal  constant  magnitude  cup. 

Therefore,  the  moment  function  c^(t)  achieves  an  approximately  stationary  value 
(with  error  not  exceeding  1%,  for  example)  may  serve  as  the  criterion  for  practical 
cessation  of  the  pattern  process. 

Upon  cessation  of  the  pattern  process,  the  designed  filter's  impulse  response 
must  equal  [in  accordance  with  (21.89)] 


n(/)»e,«Pi(0. 


(a.  91) 


where  et »  ct  KP  . 


Since  filter 0^  (Figure  21.6)  has  impulse  response  ei^(t),  section  AB  of 
the  Figure  21.6  circuit  has  (upon  cessation  of  the  pattern)  a  (Z1.91)-type  impulse 
response.  Consequently,  in  the  operating  node,  the  circuit  section  comprising 
filter  0^  and  the  constant  factor  c^  multiplier  must  play  the  role  of  needed 
optimum  linear  filter.  Therefore,  in  the  operating  mode,  realizations  y(t)  are 
supplied  to  filter  0^  input,  while  result  T'(t)  of  the  reproduction  of  signal 
uc(t)  is  picked  off  the  multiplier  output  at  point  B. 

Example  2.  Use  in  signal  detector  system  design. 

Supplied  to  detector  input  is  noise  «m(0  or  signal-plus-noise 


y(/)-Ue«;  a;...;  am)  +  tt|ll(0.  (21.92) 

Let  the  following  composite  error  probability  be  the  optimized  quality  index 

Pod.=P(0)/>aT+P(C)/>np.  (21.93) 

i.  e.,  a  priori  probabilities  P(0)  and  P(C)  of  signal  absence  and  presence  are 
known.  However,  the  statistical  characteristics  of  the  signal  (when  it  is  present) 

and  the  noise  are  unknown.  Therefore,  conditional  distributions  P  (y)  and  P  (y)  also 

.  o  *1 

are  unknown. 

Under  these  conditions,  given  the  presence  of  pattern  realizations  yQ(t), 
optimum  detector  system  design  is  possible  based  on  pattern  algorithm  (21.68). 

It  order  for  it  to  be  used,  quality  index  (21.93)  must  be  reduced  to  the  (21.63) 
form.  This  may  be  done  by  considering  the  following  special  binary  detection 
features  examined  in  §  14.3. 

During  binary  detection,  the  decision  rule  may  be  written  in  the  following 

form: 


i.  if  vC A; 

T«r(y)=  (21.94) 

'  0.  if  yfB, 


where  unity  denotee  a  "yea"  decision  (signal)  and  zero  a  "no"  decision  (no  signal); 


y  *  y(t)  —  input  realization  obaerved  during  time  T.  The  requisite  (ideal)  decisions 
here  are: 


i.  if  (/)  iO, 
0.  if  tic  <0  ?  «> 


(21.95) 


(here  uc(t)  ^  0  denotes  signal  presence,  while  ug(t)  =_  0  denotes  signal  absence). 

As  noted  in  §  14.2,  optimum  detector  system  design  will  boil  down  in  essence 
to  seeking  the  optimum  boundary  between  osculating  regions  A  and  B  of  decisions 
"1"  and  ”0,"  respectively.  Algorithm  (21.68)  is  applicable  only  in  those  cases 
when  only  population  C  of  parameters  of  a  system  of  known  structure  is  subjected 
to  system  design.  Therefore,  the  recommendation  in  [132]  is  to  find  the  boundary 
between  regions  A  and  B  in  the  form 


;(y.C)*c,<r1(y)+  ...+***„  M~cr  9  (//)*=0,  (21.96) 

where  ^^(y),  .  .  .,  <pN(y)  —  known  (preselected)  linearly-independent  functions 
or,  in  the  more  general  case,  known  operators  (processing  algorithms)  of  realizations 
y(t). 


This  denotes  that  the  decision  rule  is  sought  in  the  form^  s  P (y,  C),  /416 
i.  e.,  has  the  following  form: 
if 

Crf<f»0,  (21.97) 

then  realization  y(t)  belongs  to  region  A  and,  consequently,  the  decision  taken 
must  be/'s  1; 

if  condition  (21.97)  is  not  satisfied,  i.  e., 

c'ffixo. 

then  realization  y(t)  falls  in  region  B  and  the  decision  taken  is  Ys  0* 


light  of  thia,  quality  index  (21.93)  nay  be  represented  in  the  following 


form 


P  (Q  *»  f«n(<5“P  (ttihC)  4-  to=  •)  i 


(21.98) 


where  P  ( )  denotes  that,  as  usual,  the  probability  of  the  occurence  of  the  event 
indicated  in  the  brackets. 


Actually,  it  follows  from  (21.94)  and  (21.95)  that  both  J  and  *f  may  receive 
only  values  1  and  0.  Therefore,  the  sum  (  Y  +  f )  may  equal  unity  only  in  one 
of  the  following  incompatible  cases:  ° 


a)  either  Y  s  1*  T0  8  0* 

b)  or  T=  0*  T„  8  1 


(21.99b) 


But,  it  follows  from  (21.95)  that  7Q  s  B  when  and  only  when  uc(t)  =  0, 
•hlle  Y0*  1  when  and  only  when  ue(t)  4  0,  Therefore,  conditions  (21.99a)  and 
(21.99b)  also  may  be  written  in  the  following  form 

a)  either  7*1,  u  (t>  a  0  (2L 


b)  or  Y  •  Q,  u_(t)  *  0 


(21.100a) 
(21.  UIQb) 


the  probability  of  the  first  of  these  events  equals  P(0)  pMT ,  while  the 
probability  of  the  second  event  equals  P(0Pv.  Therefore,  the  probability  of 
the  occurence  of  even  one  of  the  two  incompatible  events  equals 


Consequently,  quality  index  (21.98)  actually  completely  coincides  with  initial 


We  will  designate 


l(r.C)-T(r.C)-fT»- 


(21.101) 


i:.  '"t: 

|.V 


;it "  i J  l  y  ■ 

V'v-W 


I 


Then,  from  (21.98),  we  have 


7(Q=PU(y.  C)  =  1 ) . 


(21.102) 


In  order  to  reduce  this  expression  to  the  (21.63)  form,  additional  determinate 
function  6  such  that 


•«)-( 


l.  if  5  = 

0.  if  S*l. 


Then,  the  result  is 


(21.103) 


/(C)-AI{Q(6)). 


(21.104) 


Actually,  it  follows  from  (21.103)  that 


M{a<6)}  =  »Ptt=0+OPU=o}  =  P{5=i}. 


which  coincides  with  expression  (21.102)  for  7(0. 

So,  initial  expression  (21.93)  for  a  detector  quality  index  actually  /417 
may  be  represented  in  the  (21.104)  form  or,  considering  (21.101)  and  (21.103), 
in  the  form 


f  (Cj-M  (0(T(lf.cj+T»)). 


(21.105) 


This  expression  with  respect  to  its  structure  completely  coincides  with  expression 
(21.63).  Function  Q(  )  plays  the  role  of  function  Q  (  )  here.  Since  magnitude 

Q  may  have  only  the  values  1  and  0  [see  (21.103)],  then  function  Q  (  )  has 

no  derivative  with  respect  to  C.  Therefore,  as  noted  above,  approximate  relationship 
(21.72)  should  be  used  to  determine  gradient  VeQ()  ,  i.  e. ,  a  (21.76)-type  basic 
algorithm: 


11— iMflivoQ  (t  (if  1*1.  C l«—ll  )+•»•). 


(21.106) 


Here,  gradient  ^«Q(t  (h.  Q  +  ?»)  is  determined  from  relationships  (21.73)  and  (21.74), 


in  which  Q  should  be  replaced  by  Q  .  In  accordance  with  (21.70),  for  simplicity 
it  is  possible  to  assume  that 

A  («|  —  /  « {«). 

It  follows  from  (21.106)  that  ana  must  have  correct  (ideal)  decisions  If 
determined  from  relationships  (21.9$)  to  find  optimum  value  C„ p  of  vector^?. 
Evidently,  it  is  impossible  in  principle  to  obtain  such  decisions  in  the  operating 
mode  and  a  special  mode  is  required— the  pattern  mode.  The  detector  may  switch 
to  the  operating  mode  only  after  the  pattern  mode  ceases,  i.  e. ,  after  determination 
of  the  optimum  vector  C«p  value. 

In  light  of  the  above,  the  structural  diagram  of  a  detector  patterning  in 
accordance  with  algorithm  (21.106)  may  be  represented  in  the  form  depicted  in 


Figure  21.7.  (a)  —  OS  [signal  detector];  (b)  —  P  [switch]; 

(c)  —  G  [generator];  (d)  —  F  [filter];  (e)  —  DI  [discrete  integrator]; 

(f)  —  K  [switch] 

Figure  21.7.  The  circuit  section  enclosed  by  the  dotted  line  in  this  figure  /418 
has  the  same  purpose  and  composition  as  the  Figure  21.4  circuit. 


Unit  oc  —  signal  detector,  parameter  vector  C  of  which  is  optimized  during 
the  pattern  process.  This  unit's  structure  is  determined  by  decision  rule  (21.97) 
and  is  expended  in  Figure  21.8.  Here,  Bt, ....  computers,  each  of  which  computing 


Figure  21.8.  (a)  —  V  [computer];  (b)  —  PU  [threshold  device]. 

magnitudes  c£\(y)  and  multiplying  the  result  by  coefficient  (1  :  1,  .  .  ., 

N).  Computer  output  voltage  sum  u^  reaches  the  threshold  device,  which  supplies 
unity  (7=  1)  when  u^^^Xl  and  zero  (  7s  0)  when  <  0. 

We  now  will  return  to  the  Figure  21.7  circuit.  All  elements  in  this  circuit, 
except  detector  unit  oc,  may  be  considered  an  adjunct  to  the  detector,  required 
to  accomplish  detector  "training."  Switches  77,  and  nt  accomplish  the  switching 
from  the  pattern  mode  (position  1)  to  the  operating  mode  (position  2). 

In  the  pattern  mode,  pattern  realizations  yQ(t)  shaped  from  wit* 
and  signal  u^t)  reach  detector  OC  input.  Signal  voltage  ucQ(t)  is  connected 
via  switch  controlled  by  generator  F  .  This  same  generator  processes  pattern 
correct  decisions  fQ  synchronously  with  the  switch  closing:  when  the  switch 
is  closed,  7q  s  1  is  issued,  with  70  s  0  supplied  in  the  remaining  cases. 

The  pattern  comprises  a  series  of  cycles,  each  with  duration  T  equalling  the  duration 
of  the  time  that  will  be  allocated  to  detection  in  the  operating  mode.  In  each 
cycle,  signal  u^t)  is  either  cut  in  or  cut  out,  with  probabilities  P(C)  and 
P(0),  respectively.  Therefore,  generator  r  must  comprise  a  statistical  mechanism 
insuring  random  closing  and  opening  of  switch  (at  time  T  each  time)  in  accordance 
with  these  probabilities. 


The  circuit  portion  between  points  A  and  B  will  process  magnitude  I  •  of  function 

Q_(().  determined  by  formula  (21.103). 

During  the  pattern  process,  detector  parameters  c^,  .  .  . ,  change  with 

respect  to  time,  approximating  (when  t - »-oo)  constant  values  cnpt, ....  eKpfl  ,  which 

correspond  to  the  optimum  [in  the  sense  of  quality  indicator  (21.93)]  vector  C 
value  Ckp  .  Essentially,  patterning  may  be  considered  completed  if  parameters 
c^,  .  .  . ,  approximates  their  stationary  value  with  sufficient  accuracy  (with 
accuracy  no  worse  than  1%,  for  example).  Here,  Iti  and  i7,  will  convert  detector 
OC  to  the  operating  mode:  operating  realizations  y(t)  are  supplied  to  detector 
input,  while  decisions  f  about  signal  presence  ( f  =  1)  or  absence  (  f=  0)  are 
shaped  at  its  output. 

Evidently,  patterning  will  be  effective  only  given  signal  u£(t)  and  noise 
“■  (0  statistical  characteristics  in  the  operating  mode  are  identical  to  those 
in  the  pattern  mode. 

This  brief  examination  of  the  stochastic  approximation  method  makes  it  /419 
possible  to  draw  the  following  conclusions  about  this  method's  basic  advantages 
and  disadvantages. 

The  chief  advantage  is  that  this  method  makes  it  possible  relatively  simply 
to  design  receiving  systems  under  the  difficult  conditions  of  incomplete  a  priori 
information.  Here,  the  method  is  applicable  not  only  for  unknown  parameters  of 
additive  and  non-additive  noise  distributions,  but  also  for  completely  unknown 
types  of  these  distributions.  A  second  advantage  is  absence  of  often  unsubstantiated 
assumptions  that  an  unknown  a  priori  distribution  is  uniform. 

However,  there  are  serious  shortcomings  inherent  in  this  method  (just  as 
in  any  other). 

First,  there  is  the  requirement  to  provide  requisite  conditions  for  creating 
the  pattern  mode,  which  is  not  always  possible. 

Second,  and  more  important,  only  the  parameters  of  a  system  with  a  known 
structure,  rather  than  the  system  as  a  whole  (including  its  structure  as  well), 


are  optimized.  For  example,  in  example  2  above,  it  is  assumed  that  the  detector 
structure  has  the  form  depicted  in  Figure  21.8  and,  moreover,  the  specific  form 
of  functions  (or  operators)  .$^(y),  .  .  .,  ,$^(y)  is  given  since  only  factors 
c^,  .  .  . ,  c^j  will  be  subject  to  system  design.  [In  example  1,  this  shortcoming 
manifests  itself  to  a  lesser  degree  since  it  is  evident  that,  given  variations, 
applicable  for  practice,  in  the  number  of  parameters  c^,  .  .  .,  c^,  it  is  possible 
to  provide  essentially  any  type  of  synthesized  impulse  response  7|  (t)]. 

In  addition,  the  problem  of  optimum  selection  of  the  discrete  function  a[n] 
type  (including  the  magnitude  of  the  quantization  step)  or  of  its  continuous  analog 
a(t)  still  is  only  solved  partially.  Extant  materials  mainly  boil  down  to  indications 
of  the  necessary  conditions  which  function  a[n]  must  meet  in  order  to  provide 
pattern  process  convergence  [to  (21.59)-type  conditions,  for  instance].  But, 
the  type  of  this  function  significantly  impacts  also  on  the  pattern  process  duration 
(conversion  rate)  and  on  pattern  quality  overall.  Therefore,  it  is  very  important 
that  more-optimum  function  a[n]  [or  a(t)]  selection  methods  be  developed. 

The  problem  of  the  noise  immunity  of  the  pattern  process  also  has  not  been 
studied  sufficiently,  i.  e.,  the  problem  of  the  impact  of  noise  additional  in 
comparison  to  that  extant  in  the  operating  mode. 

Finally,  failure  to  make  optimum  use  of  extant  a  priori  information  is  a 
very  significant  drawback  to  the  stochastic  approximation  method.  For  instance, 
in  example  2  examined  ebove,  no  a  priori  (preceding  initiation  of  patterning) 
information  about  the  signal  and  noise  is  used  directly  when  organizing  the  pattern 
process  (only  a  priori  information  on  signal/no  signal  probabilities  is  considered). 
This  circumstance  is  an  advantage  of  the  method  if  such  a  priori  information  actually 
is  completely  lacking.  But,  this  becomes  a  liability  in  those  cases  when  a  priori 
information  is  only  partially,  rather  than  completely,  lacking.  Evidently,  /420 
when  even  some  partial  a  priori  information  is  present,  it  must  be  used  during 
selection  of  the  designed  system's  structure  (in  particular,  in  selection  of  functions 
(or  operators). $  ^(y),  .  .  . ,  $N(y)].  However,  at  present,  the  problem  of  using 
extant  a  priori  information  to  optimize  the  stochastic  approximation  process  still 
is  far  from  being  solved  completely. 


Thu 3,  the  theory  of  stochastic  approximation  still  is  in  the  initial  stage 
of  development.  However,  even  now  its  use  may  turn  out  to  be  helpful  in  solving 
several  practical  problems. 

21.5  Concluding  Comments 

The  examination  of  various  ways  of  designing  receiving  systems,  given  incomplete 
a  priori  information,  presented  in  this  chapter  demonstrates  that  all  these  methods 
still  are  in  the  initial  stage  of  development  and  each  has  signficant  advantages 
and  disadvantages.  In  the  future,  apparently,  there  will  be  succes  in  eliminating 
some  of  the  shortcomings  inherent  in  them.  This  may  be  achieved  both  throuc^i 
independent  development  of  each  method  and  by  their  joint  supplementation  and 
enrichment. 

However,  as  noted  in  the  beginning  of  this  chapter,  as  a  result  of  the  infinite 
variety  of  degrees  of  a  priori  information  incompleteness,  there  is  no  basis  upon 
which  to  assume  that,  even  in  the  future,  a  single  system  design  method  sufficient 
for  all  situations  will  be  developed.  Therefore,  under  complicated  conditions, 
to  increase  reliability,  it  may  be  useful  to  undertake  system  design,  not  in 
accordance  with  one,  but  using  at  least  two  methods  and  then  comparing  the  results 
obtained. 


CHAPTER  TWENTY-TWO 
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SPECIAL  FEATURES  OF  RADIO  RECEIVING  SYSTEM  DESIGN  BASED  ON  SEVERAL  QUALITY  INDICES 
22.1  System  Design  Problem  Formulation  Based  On  Several  Quality  Indices 

Radio  receiving  device  quality  is  estimated  by  several  indices  characterizing 
message  fidelity  (authenticity),  operating  reliability,  cost,  weight,  overall 
dimensions,  and  so  on.  Message  fidelity  (authenticity),  in  turn,  often  must  be 
estimated  not  by  one,  but  by  several  indices  such  as  false-alarm  and  miss  /421 
probability  (for  binary  detection),  root-mean-square  error  probability,  anomalous 
error  probability  (during  analog  message  reception),  and  so  forth.  Therefore, 
in  general  form,  system  design  of  a  radio  receiving  device,  just  like  any  other, 
must  be  formulated  in  the  following  manner. 

System  operating  conditions  y  ,  limitations  TTplaced  on  it,  and  quality 
index  K  are  given.  Here,  the  arrows  underscore  that,  in  the  general  case,  there 
may  be  several  conditions,  limitations,  and  indices;  for  instance,  vector"?  designates 
the  population  of  all  quality  indices  which  must  be  considered  during  system  design: 

*-{*..*. . ».}■  (“-D 

Message,  signal,  and  noise  characteristics,  in  particular,  fall  in  the  category 
of  conditions  y  .  Limitations  If  may  comprise  requirements  for  a  single-channel 
designed  system,  its  linearity,  and  so  on. 
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The  system  design  problem  is  to  find  that  system  S  which,  for  given  conditions 
y  and  limitations  Tf,  provides  the  best  quality  index  population-^  value. 

Depending  on  the  type  of  each  elementary  quality  index  k.  (i  :  1,  .  .  . ,  m), 
its  smallest  or  largest  value  is  best.  For  instance,  a  system  is  better,  the 
lower  error  probabilities  P»  rand  PBV  and  the  higher  the  probability  P9  of  no 
failure  (in  the  technological  reliability  sense).  However,  if  the  best  value 
of  some  quality  index  k^  is  its  highest  value,  then  it  always  is  possible  to  reduce 
equivalent  index  k'^  so  that  its  smallest  value  is  best.  For  example,  instead 
of  correct  detection  Pa ,  and  no  failure  PM  probabilities,  it  is  possible  to  introduce 
error  probability  Pom  **  1  —  •  and  no  failure  probability  P*  **  I  —  Pm  . 

Thus,  in  future,  it  is  possible  without  disrupting  generality  to  assume  that 
all  quality  indices  k^,  .  .  .,  k^  are  positive  and  their  smallest  values  are  best. 

Here,  the  best  system  would  be  one  providing  the  minimum  of  all  quality  indices 
k^,  .  .  .,  k^.  However,  in  the  general  (non degenerative)  case,  it  is  impossible 
to  select  a  system  immediately  providing  the  minimum  of  two  or  more  quality  indices. 
Thus,  no  single  standard  system  exists  making  it  possible  to  choose  the  best  system 
if  a  (22.1)-type  vector  quality  index  characterizes  its  quality.  It  is  possible 
only  to  indicate  several  different  ways  to  solve  such  a  problem,  each  having  its 
advantages,  disadvantages,  and  applicability. 

In  future,  we  will  use  the  term  vector  for  system  design  based  on  population 
(22.1)  of  several  quality  indices,  while  we  will  use  the  term  scalar  for  system 
design  based  on  a  single  (one)  index.  In  radio  electronics,  methods  based  on 
some  sort  of  reduction  of  vector  system  design  into  scalar  are  the  most  widespread. 

The  first  method  is  based  on  replacement  of  quality  index  population  /422 
(22.1)  with  some  resultant  (combined)  quality  index  k9  ,  which  is  a  known  function 

kp  ■» f  (ku  kv  (22.2) 

of  elementary  indices  k^,  k^,  .  .  *t  km  and  design  of  a  system  based  on  this  indi¬ 
cator. 


604 


The  second  method  is  bssed  on  conversion  of  all  quality  indices  except  one, 
celled  the  main  index,  into  a  series  of  limitations.  Then  the  problem  will  boil 
down  to  system  design  based  on  single  quality  index  k^,  given  limitations'?’'  which 
includes,  along  with  initial  limitations-?,  limitations  of  the  following  type 
of  equalities  as  well 


kt  km  =  knt 


or  inequalities 


(22.3) 


(22.4) 


or  of  mixed  limitations. 

The  third  method  differs  from  the  second  only  in  that  some  indices  kg, 

.  .  . ,  kffl  (or  all  indices)  during  system  design  are  not  considered  at  all  or 
are  considered  only  qualitatively  (for  example,  during  mathematical  system  design, 
system  cost  usually  is  not  considered  at  all  or  is  considered  only  to  the  extent 
that  the  designed  system  is  sought  in  the  class  of  single-channel  or  linear  systems) 

The  first  method  is  the  most  natural  if  the  designed  system  (device)  is  pert 
of  a  more-complex  system  characterized  by  quality  index  k,  .  In  this  case,  indices 
k^,  .  .  . ,  km  may  be  considered  parameters  of  the  more-complex  system.  However, 
the  first  method  has  the  following  significant  drawbacks: 

1.  The  designed  system  may  be  so  autonomous  (radio  measuring  instrument, 
for  example)  that  it  cannot  be  considered  part  of  a  more-complex  system  and  there 
is  sufficient  substantiation  to  introduce  some  resultant  quality  index  k9. 

2.  A  more-complex  system  in  which  the  given  system  is  included  often  is 

characterized,  not  by  one,  but  by  several  quality  indices  kpl,  kvt . ,  which 

are  dependent  on  indices  k^,  kg,  .  .  .,  kffl.  In  this  case,  it  usually  is  impossible 
with  sufficient  substantiation  to  replace  index  population  k^,  .  .  .,  k^  by  a 
single  resultant  index. 


3.  Even  in  those  cases  when  the  designed  system  is  port  of  a  more-complex 


system  characterized  by  unitary  quality  index  kp  ,  it  may  be  difficult  or  impossible 
to  use  the  first  method  for  the  following  reasons: 

a)  if  the  more-complex  system  is  developed  approximately  at  the  same  time 
as  the  given  system,  then  a  (22.2)-type  functional  dependence  usually  is  unknown; 

b)  even  if  the  (22.2)-type  dependence  is  known,  design  of  the  given  /423 
system  (subsystem)  based  on  combined  criterion  (22.2),  being  possible  in  principle, 
may  tum  out  to  be  essentially  very  complex. 

Thus,  the  first  method,  in  a  number  of  cases  more  natural  and  advisible, 
is  far  from  being  standard. 

The  second  method  has  the  following  basic  shortcomings: 

1.  In  a  majority  of  cases,  it  is  impossible  with  sufficient  basis  to  consider 
one  quality  index  to  be  the  main  one  and  the  others  secondary.  In  addition,  in 
the  general  case,  it  is  impossible  to  be  sure  that  selection  of  one  index  rather 
than  any  other  as  the  main  index  will  provide  best  system  design  results. 

2.  For  quality  indices  k^,  .  .  .,  kffl  converted  into  a  series  of  limitations, 
it  usually  is  impossible  unequivocally  to  establish  their  permissible  values  k2(j, 

.  .  .,  k^  and  it  is  impossible  to  be  sure  that  just  this  specific  combination 
of  these  values  will  provide  the  best  system  design  results. 

The  third  method  still  to  a  great  degree  is  more  random  than  the  second  since 
it  requires  not  only  conversion  of  a  series  of  quality  indices  into  a  series  of 
limitations,  but  complete  disregard  (during  the  system  design  process)  of  one 
or  several  indicators. 

Thus,  sufficiently-substantiated  reduction  of  vector  system  design  into  scalar 
system  design  often  is  impossible,  especially  in  the  initial  system  project  planning 
stage.  Here,  it  is  necessary  during  the  system  design  process  to  compare  a  system 
simultaneously  with  respect  to  all  quality  index  populations,  i.  e. ,  with  respect 
to  vectorlT  components. 

We  will  assume  that,  in  an  m-dimensional  quality  index  space  (Figure  22.1), 
one  and  only  one  vector  T?  value  corresponds  to  each  system  S  and,  conversely, 
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Figure  22.1 

only  one  system  S*  corresponds  to  each  point  a  of  this  space.  When  comparing 
quality  indices 

K  k§t  •••• 

*; . o 


we  will  use  the  following  designations: 


1.  K'  Si  K  when  and  only  when  /4I 

V<*»  (/-l . m).  (2 

2.  If'  <.  iTwhen  and  only  when  for  *11  values  of  the  number  if 

with  the  exception  at  least  of  one  number*  for  which 

kf  <  kf.  f  n. 


3.  <  "iT  when  and  only  when 


*,'<*«  (f-1 . m).  ^ 

♦If  one  and  the  same  quality  index  may  have  an  entire  class  of  systems*  then 
S  should  be  understood  to  mean  this  class. 


..‘<V 


We  will  call  system  S  (K' )  better  than  system  S(K)  when  and  only  when  even 

i 

one  of  the  system  S  elementary  quality  indices  is  less  then  and  the  remaining 
indices  not  greater  than  the  corresponding  system  S  indices.  Mathematically, 
this  preference  ratio  may  be  written  in  the  following  form: 


System  S'(K')  is  better  than  S(K)  when  and  only  when 


(22.9) 


This  preference  ratio  is  called  unconditional  since  its  validity  is  unconditional, 
i.  e.,  it  causes  no  doubts  and  does  not  require  introduction  of  any  additional 
conditions  and  assumptions  for  substantiation. 

Evidently,  the  following  formulation  of  this  unconditional  preference  ratio 
completely  is  equivalent  to  the  preceding  formulation: 

System  S' (K'  )  is  worse  than  system  S(K)  when  and  only  when  even  one  of  the 
« 

system  S  elementary  indices  is  greater  and  the  rest  are  less  than  those  in  system 

s(i3. 

In  other  words,  system  S'  (K' )  is  worse  that  system  S(l$  when  and  only  when 


K’>K. 


(22.10) 


On  the  basis  of  preference  ratio  (22.9)  [or  (22.10)],  set  Af,  of  all  possible* 
systems  (Figure  22.1)  is  divided  into  two  nonintersecting  classes:  set  M%  of 
worse  systems  and  set  M„of  non-worse  systems.  Point  (system)  a  6  Af,,  with  quality 
index  K,  by  definition  falls  into  the  class  of  worse  if  there  is  even  one  (other) 
point  (system)  having  quality  index  K'  ^  K  in  set  Af,  .  Correspondingly,  point 
(system)  a  6  Af,  falls  into  the  non-worse  class  if  no  one  point  (system),  better 
in  comparison  with  it,  exists  in  region  Aft,  i.  e.,  there  is  no  point  (system) 
o'  €  Af  .where’ll  <  T. 


•Here  and  in  future,  the  term  possible  is  used  for  systems  satisfying  given 
conditions  7  and  limitations  TT. 


■ 


v.-..V7r.’7.V.‘7.'. 


It  follows  from  these  determinations  of  worse  and  non-worse  points  (systems) 
that  any  set  Af,  point  (system)  must  be  either  worse  or  non-worse,  i.  e. ,  /425 

belong  to  set  M*  or  Aful . 

Actually,  we  will  take  random  point  a  from  set  Af,  having  some  quality  index 
Then,  evidently,  only  the  following  two  mutuallv-exclusive  situations  are 
possible: 

1.  In  set  Af.  ,  there  is  just  one  point  a'  having  quality  index  1?  <  "iT. 

2.  In  set  Af,  there  is  no  single  point  a'  having  index-?'  <  K. 

But,  if  situation  1  occurs,  then  point  a  is,  by  definition,  worse.  If, 
conversely,  situation  2  occurs,  then,  by  definition,  point  a  must  be  considered 
non-worse.  Thus,  actually  any  set Af,  point  (system)  must  be  either  worse  or  non-worse. 

Since  the  object  of  the  system  design  is  finding  the  best  point  (system), 
then  all  worse  points  (systems)  must  be  discarded  and,  at  the  next  system  design 
stage,  only  non-worse  paints  considered  (examined). 

If  we  will  compare  among  themselves  any  two  non-worse  points  a(K)  and  a'(K'), 
then  it  is  impossible  to  designate  either  one  of  them  unconditionally  better  or 
worse  than  the  other  [i.  e. ,  better  or  worse  within  the  bounds  of  preference  ratio 
(22.9)  or  (22.10)].  Actually,  we  will  assume  that  we  recognized  a'(K')  as  better 
then  a(K).  This  denotes  that  a  point  better  than  a  exists  in  set  Af,  .  Hence, 
it  follows  that  point  a  belongs  to  worse  points,  and  not  to  non-worse  points, 
which  contradicts  the  initial  assertion  that  point  a  is  a  non-worse  point. 

Analogously,  it  also  demonstrates  that  non-worse  point  a*  may  not  be  called 
worse  than  another  non-worse  point  a.  This  supposition  may  be  explained  also 
as  follows. 

When  comparing  non-worse  points  a'(K')  and  a(K),  in  accordance  with  their 
determination,  it  is  mandatory  that,  if  a  lesser  value  of  even  one  of  the  quality 
indices  corresponds  to  one  of  the  points,  then  a  higher  value  of  any  other  of 


m  indices  will  correspond  to  this  same  point.  Therefore,  no  unconditional  preference 
may  be  given  to  one  non-worse  point  compared  to  another. 

Hence,  it  follows  that,  in  order  to  be  able  to  select  any  one  point  (system) 
from  the  population  of  non-worse  points  (systems),  some  additional  condition  (or 
population  of  conditions)  must  be  introduced  to  supplement  the  unconditional  pre¬ 
ference  ratio.  Thus,  remaining  within  the  bounds  of  the  unconditional  preference 
ratio,  the  solution  to  the  system  design  problem  may  be  reduced  only  to  finding 
population  (set)  MHX  of  non-worse  points  (systems). 

In  a  nondegenerative  case,  when  set  comprises  just  one  point,  this  point 
(system)  is  best  (since  all  set  Mt  points  are  worse).  Consequently,  the  non-worse 
point  is  best  when  and  only  when  set  Mmx  is  nondegenerative. 

We  will  designate  the  non-worse  system  S  and  the  vector  quality  index  /426 
corresponding  to  it 

. *J.  (22.11) 

Then,  its  following  property  flows  from  the  determination  of  the  non-worse 
point  (system)  given  above. 

Let  selected  derivatives  (m  —  1)  of  fundamental  indices,  k^,  kj,  k^, 

.  .  . ,  km  for  example,  equal  values  k^,  fcj,  k^,  .  .  •»  km»  respectively.  Then, 
it  is  possible  to  assert  that  there  is  no  other  point  (system)  in  set  Mx  which, 
given  such  index  k^  k3»  •  •  •»  km  values,  would  have  the  index  k2  value,  which 
did  not  remain  fixed  identical  to  or  lesa  than  k2* 

A 

Actually,  if  another  point  would  have  kj  *  k 2,  then  this  would  denote  that 
all  m  quality  indices  of  another  point  coincide  with  the  corresponding  indices 
of  a  given  non-worse  point.  But,  this  would  signify  that  "another"  point  in  actuality 
coincides  with  the  given  point.  If  you  assume  that,  given  coinciding  (m  —  1) 
indices,  another  point  has  a  lesser  value  of  the  m-th  index  (index  k2  in  this 
case),  then  this  denotes  that  another  point  is  better  than  the  given  point.  But, 
this  is  impossible  since,  based  on  the  condition,  the  given  point  is  non-worse 
and,  consequently,  no  better  point  than  it  exist''. 


For  example,  consider  that  m  =  3  and  we  found  non-worse  point  a 
(Figure  2.22).  Then,  the  following  assertions  are  valid: 


A 


A  A 

1.  For  the  givens  (k,,  k.),  no  point  (system)  which  would  have  an  index 

kj  value  less  than  or  equal  to  k^  exists  in  space  Af, .  In  other  words,  the  ordinate 
of  point  a  indicates  the  minimum-possible  index  kj  value  which  may  be  achieved 
for  given  conditions  y  ,  limitations  0,  and  fixed  values  (k^,  k^)  of  the  remaining 
quality  indices. 

A  A  A 

2.  For  givens  k2,  k^,  the  minimum-possible  index  k^  value  equals  k^. 

A  A  A 

3.  For  the  givens  (k^,  k^),  the  minimum-possible  index  k^  value  equals  k^. 

Evidently,  in  the  case  of  m  quality  indices,  it  is  possible  to  make  m  analogous 
assertions. 

It  follows  from  the  aforementioned  non-worse  point  property  that  a  minimum- 
possible  (potential)  value  of  any  of  the  quality  indices,  given  fixed  values  for 
all  remaining  (m  —  1)  indices,  corresponds  to  each  such  point.  Consequently, 
if  one  succeeds  in  finding  setAfw«of  non-worse  points  (systems),  then  this  /427 
denotes  that  one  may  succed  in  finding  m  dependences  of  the  type 

Jt  «m  — fl  (^*»  •••»  ^m)* 

*1  MU  ••••  ^m)» 


(22.12) 


where  values  •••»  Hr*  may  be  called  particular  potential  values  of 

quality  indices  k^,  k^,  .  .  . ,  (as  opposed  to  a  single,  global,  potential  value 
which  will  be  found  in  a  case  of  system  design  with  respect  to  a  single  index). 

The  following  basic  properties  of  non-worse  points  may  be  formulated  on  the 
basis  of  what  has  been  presented: 

Property  1.  All  set  Af,  points  (systems)  that  are  non-worse  are  worse  (in 
the  sense  of  vector  quality  index"?). 

Property  2.  No  non-worse  point  (system)  may  be  recognized  as  better  or  worse 
than  another  non-worse  point  (system). 

Property  3.  If  only  one  non-worse  point  (system)  exists,  then  it  is  the 

best. 

Property  4.  A  minimum  (best-possible,  potential)  value  of  any  quality  index 
k.  (i  :  1,  .  .  . ,  m)  corresponds  to  each  non-worse  point  (system),  given  fixed 
values  of  all  remaining  (m  —  1)  quality  indices. 

Several  additional  important  properties  of  non-worse  points,  methods  of  finding 
these  points,  and  several  examples  are  examined  in  the  following  sections.  Here, 
further  examination  is  limited  to  cases  of  two  and  three  quality  indices  in  accordance 
with  the  following  considerations: 

1.  Cases  of  mathematical  system  design  with  respect  to  two  or  three  quality 
indices  are  encountered  most  often  in  practical  problems. 

2.  When  more  than  three  quality  indices  are  involved,  system  design  results 
using  the  unconditional  preference  method  are  considerably  less  clear  and,  in 
particular,  do  not  allow  clear  geometric  interpretation. 

22.2  Two  Quality  Indices 


In  the  case  of  two  quality  indices  (m  =  2),  general  expressions  (22.12)  take 


the  following  form: 


m«b  **/i  (^»)»  (22.13a) 

i|«.s/i(4  (22.13b) 

The  set  of  non-worse  points  must  satisfy  both  these  relationships,  i.  e.,  must 
possess  the  following  properties: 

1.  For  each  index  l<2  value,  value  k^  must  be  the  minimum-possible  /428 
value,  i.  e,  for  a  given  k2»  no  points  (systems)  less  than  k^  must  exist. 

2.  For  each  index  k^  value,  value  k2  must  be  minimum,  i.  e. ,  for  a  given 
k^,  no  points  less  than  k2  must  exist. 

It  is  easy  to  demonstrate  that  non-worse  points  may  be  those  and  only  those 
points  which,  on  quality  index  plane  k^  0  k2  (Figure  22.3),  are  located  on  a  monotone 


Figure  22.3  Figure  22.4 


nonincreasing  curve,  both  when  k^  rises  and  when  k2  rises. 

Consequently,  in  a  case  of  two  quality  indices,  the  set  of  non-worse  points 
has  the  form  of  a  monotone  nonincreasing  curve,  both  when  k^  rises  and  when  k2 
rises.  It  is  not  mandatory  that  this  curve  be  continuous  (for  example,  see  Figure 
22.4)  and,  in  several  cases,  it  may  comprise  only  several  points  (Figure  22.5). 
However,  it  always  must  meet  the  aforementioned  condition  of  strict  monotonicity. 


The  only  exception  is  a  nondegene rat ive  case  in  which  set  Af*,  comprises  just  one 
point.  In  future,  we  will  restrict  ourselves  to  examination  only  of  those  cases 
when  a  set  of  non-worse  points  forms  a  continuous  curve. 


Figure  22.5  Figure  22.6 

It  follows  also  from  determination  of  non-worse  points  (systems)  that,  for 
given  conditions  y  and  limitations  0,  no  single  point  (system)  located  to  the 
left  or  below  the  given  curve  exists  (the  validity  of  this  assertion  is  demonstrated 
easily  from  the  converse).  Therefore,  setAfnof  non-worse  points  (systems)  is 
the  lower  left  boundary  of  region  Aft  of  all  possible  points  (systems). 

Thus,  in  a  case  of  two  quality  indices,  it  is  possible  to  add  the  following 
properties  to  the  properties  of  non-worse  points  indicated  on  page  613: 

Property  5.  Set  AfM  of  non-worse  points  forms  the  lower  left  boundary  of 
region  M*  of  all  possible  points. 

Property  6.  The  lower  left  boundary  is  a  monotone  nonincreasing  curve,  both 
when  k^  rises  and  when  k2  rises,  i.  e.,  it  has  the  property  of  strict  monotonicity. 

It  follows  from  the  aforementioned  that  system  design  with  respect  to  /429 
two  qiBlity  indices  in  essence  will  boil  down  to  finding  the  lower  left  boundary 
providing  optimum  (best-possible)  link 


between  both  quality  indices.  There  are  different  methods  for  finding  the  lower 
left  boundary.  We  will  examine  only  the  two  most-widespread  approaches. 


The  first  method  is  that  the  minimum  of  one  of  the  quality  indices  is  soucjit, 
given  a  fixed,  but  random,  value  of  the  other: 

k^  s  min,  where  k^  =  const,  (22.15a) 


or 


k^  -  min,  where  =  const. 


(22.15b) 


Such  minimization  is  accomplished  for  all  possible  (or  of  interest)  values  of 
the  fixed  (i.  e. ,  converted  into  a  series  of  limitations)  quality  index.  Here, 
if  indicator  k^  is  minimized,  then  the  result  is  a  dependence  of  the  type 


MM  ~f  pl  (^l)* 


(22.16a) 


If  k^  is  minimized,  then  the  result  is 

mm  “/p«(^*)*  (22.16b) 

Ue  will  call  dependences  (22.16a)  and  (22.16b)  type  I  and  II  performance  curves, 
respectively. 

It  follows  from  comparison  of  expressions  (22.13a),  (22.13b),  and  22.16a) 
that  a  type  I  performance  curve  satisfies  one  of  the  two  conditions  placed  on 
non-worse  points  (condition  (22.13b),  but  may  not  satisfy  the  second  (condition 
22.13a).  The  following  important  type  I  performance  curve  property  emerges: 

The  performance  curve  will  comprise  all  non-worse  points,  but  may  comprise 
several  others,  i.  e.,  worse  points,  in  addition. 

For  example,  performance  curve  aQ  a^  a^'  depicted  in  Figure  22.6  will  comprise 
section  a^  a^'  consisting  of  a  population  of  worse  points  in  addition  to  lower 
left  boundary  a  a.  . 


A  type  II  performance  curve  also  has  an  analogous  property 


/430 


Consequently,  each  of  the  performance  curves  (22.16a)  and  (22.16b)  will  comprise 
all  lower  left  boundary  points  and,  in  addition,  in  several  cases  also  may  comprise 
a  series  of  additional,  worse,  points.  These  additional  points  are  easy  to  eliminate 
since  they  are  located  in  sectors  where  the  monotone  nonincreasing  nature  of  the 
k^  dependence  on  \<2  (or  on  k^)  is  disrupted.  It  may  be  demonstrated  that  the 
performance  curve  completely  coincides  with  the  lower  left  bouncbry  when  and  only 
when  this  performance  curve  has  a  monotone  nonincreasing  nature  in  the  entire 
range  of  possible  (or  of  interest  to  us)  quality  index  values. 

These  special  performance  curve  features  make  it  possible  to  draw  the  following 
conclusions  important  in  practice: 

1.  Conversion  of  any  of  the  quality  indices  (k^  or  k2)  into  a  series  of 
limitations  does  not  hinder  obtaining  optimum  system  design  results  if,  during 
such  a  conversion,  the  minimum  of  one  of  the  indices  (k^,  for  instance)  is  souc^it 
for  all  possible  (or  of  interest  to  us)  fixed  values  of  the  other  index  (k2»  for 
example) . 

2.  Elimination  of  additional  systems  (worse  points)  in  any  of  the  performance 
curves  suffices  for  finding  the  lower  left  bouncfery,  i.  e.,  the  population  of 
non-worse  systems. 

3.  If  the  performance  curve  is  a  monotone  nonincreasing  function  (in  the 
entire  range  of  the  values  of  its  argument  of  interest  to  us),  then  it  completely 
coincides  with  the  lower  left  boundary  and,  consequently,  there  is  no  requirement 
for  elimination. 

4.  Each  lower  left  boundary  point  has  the  six  important  properties  indicated 
on  pages  612  and  614,  particularly  property  4,  which  is  very  important  in  practice. 

3.  If  a  performance  curve  [characteristic  *» (Aj),  for  example]  is  a 
monotone  nonincreasing  function,  then  it  provides  best  results,  not  only  in  the 
sense  of  that  quality  index  was  minimized  (index  k2,  for  instance),  but  also  in 
the  sense  of  the  second  qimlity  index  (converted  into  a  series  of  limitations). 
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It  should  be  noted  that,  in  practical  problems,  a  performance  curve  has  a 
monotone  nonincreasing  nature  in  a  majority  of  cases.  This  is  because  an  increase 
in  one  quality  index  denotes  a  decrease  in  requirements  levied  on  the  system  relative 
to  this  index  and,  therefore,  makes  it  possible,  as  a  rule,  to  realize  the  best 
(lesser)  value  of  another  index.  Only  in  relatively-rare  cases  may  the  population 
of  limitations  0  be  such  that  its  strict  monotonicity  is  disrupted  in  individual 
performance  curve  sections.  Therefore,  one  may  consider  the  monotone  nonincreasing 
nature  of  a  performance  curve  a  rule  that  is  disrupted  only  in  rare  cases. 

We  will  examine  as  our  illustration  use  of  the  method  examined,  which  we 
will  call  the  performance  curve  method,  for  design  of  a  binary  signal  detector 
system  in  accordance  with  the  minimum  composite  error  probability  Pom  criterion.  /431 
This,  in  essence,  was  the  system  design  problem  formulation  presented  in  preceding 
chapters  (Chapters  5,  9,  and  14): 

find  the  structure  of  a  detector  providing  minimum  composite  probability 
Pom  of  detection  error  for  a  given  (fixed)  sigpal-to- noise  ratio  q  value. 

The  detector  is  this  problem  is  characterized  by  two  quality  indices,  Pom 
and  q;  the  lower  each  of  these  indices,  the  better  the  detector.  Consequently, 
in  this  case,  it  is  possible  to  assume 


(22.17) 


System  design  is  minimization  of  one  of  these  indices  (k^)  for  a  fixed,  but 
random,  value  of  the  second  (k^),  i.  e.,  for  the  performance  curve  method  presented 
above.  As  a  result  of  the  system  design,  a  type  II  performance  curve  was  obtained: 


When  the  signal  fluctuates  slowly  and  P(0)  =  P(C)  =  0.5,  it  has  the  form 
depicted  in  Figure  9.8  (here,  ? -  dep/tf.  ).  Since  this  characteristic  has  a 
monotone  nonincreasing  nature,  then,  by  virtue  of  the  aforementioned,  it  coincides 
with  the  lower  left  boundary  and,  consequently,  is  the  geometric  location  of  all 
nor>-worse  points  (systems).  In  this  case,  all  non-worse  systems  (points)  differ 
only  in  threshold  UQ  magnitude. 


Since  the  Figure  9.8  characteristic  coincides  with  the  lower  left  boundary, 
it  has  all  its  inherent  properties  (see  pages  612  and  614)  and,  in  particular, 
this  property  important  in  practice:  for  every  q  value,  it  provides  the  minimum 
(with  respect  to  all  possible  detectors)  magnitude  of  composite  error  probability 
Pam  and,  for  each  given  value  of  magnitude  P<, a  —the  minimum  (with  respect  to 
all  possible  detectors)  value  of  the  requisite  signal-to-noise  ratio  q.  In  other 
words,  if  we  would  design  a  detector  system,  not  in  accordance  with  criterion 
Pan  =  min  (where  q  =  const),  but  in  accordance  with  the  criterion  q  =  min  (where 
Pom  =  const)  or  we  attempted  to  consider  both  quality  indices  (  Pom  and  q)  im¬ 
mediately  during  the  design  process,  then  we  would  not  be  able  to  obtain  a  detector 
with  the  best  ratio  between  Poo  and  q,  with  a  lesser  q  value  where  Pom  =  const 
in  particular. 

Ue  now  will  examine  a  second  method  of  finding  the  lower  left  boundary. 

It  involves  finding  the  population  of  systems  providing  minimum  quality  index 
linear  combination 

k«*-kt+ckt  (22.18) 

given  all  possible  positive  finite  values  of  weight  factor  c  fixed  during  the 
minimization  process,  i.  e.,  given 

0<c<oo.  (22.19) 

Ue  will  use  k^c  and  l<2c  to  designate  the  values  of  quality  indices  k^  and 
corresponding  to  that  system  Sc,  which  provides  the  magnitude  km  minimum  for 
a  given  weight  factor  c  value.  Varying  value  c  within  the  (22.19)  range,  we  obtain 
dependences  of  the  type 


*1.  (c)» 

(22.20a) 

4t|,  "fi(c). 

(22.20b) 

Eliminating  magnitude  c  from  these  relationships,  we  will  obtain 


We  will  call  this  dependence  the  weight  characteristic.  As  analysis  demonstrates, 
a  weight  characteristic  has  the  following  properties: 

1.  It  either  completely  coincides  with  the  lower  left  boundary  or  is  /432 
a  part  of  it. 

2.  A  sufficient  condition  for  the  coincidence*  of  the  weight  characteristic 
with  the  lower  left  boundary  is  coherence  of  the  characteristic  (in  the  entire 
range  of  possible  quality  indices  or  of  those  of  interest  to  us). 

Here  and  in  future,  the  weight  characteristic  is  called  linked  if  it  is  determined 
on  a  linked  set  of  points.  (A  point  set  is  called  coherent  if  it  cannot  be  repre¬ 
sented  in  the  form  of  a  union  of  two  disjoint  sets,  each  of  which  not  comprising 
threshold  points  of  the  other). 

3.  A  condition  required  for  coincidence*  of  the  weight  characteristic  with 
the  lower  left  boundary  is  convexity**  of  the  lower  left  boundary. 

Important  consequences  flow  from  these  properties: 

1.  A  weight  characteristic  may  not  comprise  all  non-worse  points  (systems), 
but  it  must  have  one  worse  point  (system). 

2.  If  a  weight  characteristic  is  coherent  (in  the  entire  range  of  possible 
quality  index  values  or  of  those  of  interest  to  us),  then  it  will  comprise  all 
non-worse  points  (systems)  rather  then  one  worse  point  (system),  i.  e.,  it  completely 
will  solve  the  problem  of  system  design  based  on  two  quality  indices. 

As  analysis  shows,  the  sufficient  condition  of  coincidence  of  the  weight 
characteristic  with  the  lower  left  boundary  is  met  in  a  majority  of  practical 
problems  (but  not  in  all  problems) .  Therefore,  a  system  design  approach  based 
on  minimization  of  the  sum  (22.18)  usually  is  effective. 

*Preciee  to  extreme  points  usually  noexistent  in  practice. 

**The  line  (curve,  function)  specified  in  segment  x  is  called  convex  (downward) 
if  all  internal  points  of  any  arc  of  its  graph  (in  region  x)  are  located  beneath 
the  chord  linking  the  ends  of  the  arc  or  on  this  arc. 


We  will  examine  the  following  case  as  our  illustration. 


u 


Consider  design  of  a  binary  detector  system  for  a  slowly-fluctuating  signal 
on  a  background  of  white  noise  based  on  two  quality  indices 


and 


(22.22) 


where  Pup  and  p*t  —  conditional  signal  miss  and  false-alarm  probabilities 
( signal-to-noise  ratio  q  at  detector  input  is  assumed  to  be  given).  A  priori 
probabilities  of  signal  presence  and  absence  and  detection  error  weight  are  unknown 
so  there  is  no  way  to  form  a  single  combined  quality  index. 

The  requirement  is  to  find  the  population  of  non-worse  systems  and  the  potential 
(best- possible)  relationship  between  quality  indices  k^  and  k^  corresponding  to 
this  population. 

We  will  solve  this  problem  through  minimization  of  expression  (22.18),  which 
in  this  case  has  the  form 


**TpBp+ePjtr. 


(22.23) 


Solution  of  this  problem  provides  the  following  results  (see  Chapters  9  and  14). 

The  optimum  system  will  comprise  matched  linear  filter,  inertialess  /433 

envelope  detector,  and  output  threshold  bias.  Given  a  linear  detector  with  a 
unitary  transfer  constant,  threshold  magnitude  equals 


t/.e-VSv.  j/  (i  +  -J-)ln[c  <!+*)) 


(22.24) 


(one  should  assume  UQC  =  0  where  *  <  )  and  is  the  single  system  parameter 

dependent  on  weight  factor  c. 


Error  probabilities,  for  the  given  weight  factor  c  value,  equal 
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(22.25) 


where  q  —  signal-to-noise  power  ratio  at  system  input;  Um  —  envelope  detector 
output  noise  voltage  mean-square  value. 


It  follows  from  (22.25) 


^■P«"  l~~Part  •  (22.26) 

It  is  easy  to  become  convinced  that  this  function  is  linked  in  the  entire  range 
of  quality  index  values  of  interest  to  us  (Pup  <  1.  Pat  <  i)  .  Consequently, 
it  completely  coincides  with  the  lower  left  boundary  for  any  q  >  0.  (Where  q  = 
s  0,  the  examination  loses  meaning,  both  from  a  physical  and  a  mathematical  point 
of  view  since,  here,  in  accordance  with  (22.24),  the  result  is  U  ^  -*•  ,  i.  e., 

the  parameter  of  the  non-worse  system  determining  its  characteristic  will  become 
indeterminate) . 

It  also  is  easy  to  become  convinced  that  the  lower  left  boundary  found  in 
this  case  actually  is  strictly  monotonic— magnitude  Pa pt  monotonically  decreases 
when  probability  Pa*«  rises. 

Thus,  in  this  example  of  binary  detector  system  design,  use  of  the  method 

of  weighted  sum  minimization  (22.18)  made  it  possible  to  find  the  population  of 

non-worse  systems.  In  this  case,  one  non- <orse  system  differs  from  another  only 

in  the  threshold  bias  U  value,  which  may  be  determined  from  formula  (22.4)  or, 

oc 

considering  (22.25),  from  the  formula 


The  link  between  the  quality  indices  of  the  populationof  non-worse  systems 
found  is  expressed  by  relationship  (22.26).  Here,  in  accordance  with  the  afore¬ 
mentioned  non-worse  system  property  4,  the  minimunv-possible  value  of  magnitude 
P»r  for  given  magnitude  P„p  and  the  minimum  value  of  probability  P0 p  for  the 
given  probability  P4T  value  corresponds  to  each  such  system  (i.  e. ,  to  each  threshold 

U  value,  in  particular), 
oc 

In  conclusion,  we  will  note  the  following  comparative  special  features  of 
the  methods  examined  for  finding  non-w or se  systems. 

Methods  based  on  finding  the  performance  curve  [relationship  (22.16)]  /434 
in  the  general  case  provide  the  same  set  of  systems  which  will  comprise  all  non-worse 
systems,  and,  in  addition,  there  may  be  several  worse  systems.  The  method  based 
on  minimization  of  weighted  sum  (22.18),  on  the  other  hand,  may  (but  this  is  not 
mandatory)  lead  to  a  loss  of  part  of  the  non-worse  systems,  but  does  not  provide 
a  single  worse  system. 

The  drawback  of  the  latter  is  more  significant  since  worse  systems  usually 
may  be  discovered  without  any  special  effort  and  screened  out  during  subsequent 
system  design;  missing  some  of  the  non-w  or se  systems  may  not  be  counterbalanced 
and  system  design  results  may  deteriorate.  Therefore,  in  those  cases  when  use 
of  the  first  method  provides  success  in  minimization  without  special  difficulties, 
its  use  is  preferable.  However,  when  meeting  several  of  the  aforementioned  ad¬ 
ditional  conditions  [strict  performance  curve  monotonicity  or  characteristic 
k^c  s  fc0<2C)  coherence],  b°th  methods  make  it  possible  to  find  the  population 
of  non-worse  systems  with  the  cfanger  of  losing  part  of  the  non-worse  systems  or 
contaminating  the  population  of  worse  systems  found. 

22.3  System  Design  Based  On  Three  Quality  Indices 

In  a  case  of  system  design  based  on  three  quality  indices,  general  relationships 
(22.12)  for  non-worse  systems  take  the  following  form: 


lira  ™  fi  ^i)» 

kt  ma  “  ft  (^t» 
*mw™ 


(22.28) 


Figure  22.7 

apace  in  Figure  22.7).  Evidently,  this  surface  is  a  generalization  of  the  concept 
of  the  lower  left  boundary  a  case  of  three  quality  indices.  Each  surface  point 
has  the  four  properties  indicated  in  $  22*1. 

We  will  limit  ourselves  to  examination  of  cases  in  which  the  optimum  surface 
is  continuous  in  the  entire  range  of  possible  (or  of  interest  to  us)  quality  index 
values.  Then,  considering  the  properties  of  non-worse  systems,  it  is  possible 
to  become  convinced  that  no  single  point  (system)  from  set  Af,  of  all  possible 
aystem8  exists  between  the  origin  and  the  optimum  surface.  Therefore,  the  optimum 
surface  will  belong  to  the  external  set  Af,  boundary. 

We  will  introduce  the  concept  of  an  operating  surface  for  further  examination. 
The  operating  surface  is  that  population  of  points  possessing  the  minimum  value 
of  one  quality  index,  given  fixed  (but  random)  values  for  the  other  indices  in 
the  entire  range  of  possible  (or  of  interest  to  us)  quality  index  values.  /435 

Then,  depending  on  which  of  the  three  indices  is  subject  to  minimization 
(as  the  operating  surface  is  formed),  we  will  have  the  following  operating  surfacest 
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a)  Primary  operating  surface 


b)  Secondary  operating  surface 


MR*  — /ip  (£*.  *,); 
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(22.30a) 


(22.30b) 


c)  Tertiary  operating  surface 


^ixn — /* p(^i*  ^t)* 


(22.30c) 
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It  follows  from  comparison  of  expressions  (22.28)  and  (22.30)  that  each  operating 
surface  will  comprise  all  points  of  the  optimum  surface  (i.  e. ,  all  non-worse 
points)  and,  in  addition,  may  comprise  several  additional  (worse)  points. 

Hence,  in  particular,  the  following  conclusion  important  for  practice  emerges: 
if  during  system  design  one  provides  minimization  only  based  on  one  quality  index, 
given  fixed  (but  random)  values  for  the  other  two  indices,  then  this  will  not 
lead  to  a  loss  of  any  of  the  non-worse  points  (systems). 

In  view  of  the  symmetry  of  relationships  (22.30a)— (22.30c),  it  is  possible 
in  future,  without  a  loss  in  generality,  to  restrict  ourselves  to  examination 
only  of  a  primary  operating  aurface.  The  equation  for  this  surface  may  be  represented 
on  a  plane  in  the  form  of  either  of  the  following  two  families: 


A*b); 


(22.31a) 


b) 


(22.31b) 


Here,  the  index  <n»  denotes  which  of  the  two  function  (22.30a)  arguments 
plays  the  role  of  the  parameter  of  the  family  of  characteristics  (Figure  22.8). 

We  will  call  these  families  type  I  and  II  performance  curve  families,  res¬ 
pectively.  It  may  be  demonstrated  that  performance  curve  families  have  the  following 
properties: 
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1.  Each  family  will  comprise  an  entire  set  M„s  of  non-worse  points  (systems) 
and,  in  addition,  may  comprise  several  additional  (worse)  points  (systems). 

2.  A  necessary  and  sufficient  condition  of  complete  operating  surface  (per¬ 
formance  curve  family)  coincidence  with  the  optimum  surface  may  be  formulated 

in  the  following  way:  all  type  I  and  II  performance  curve  families  [(22.31b)] 
must  be  monotone  nonincreasing  functions  in  the  entire  range  of  possible  /436 
(of  interest  to  us)  quality  index  values.  In  practical  problems,  the  condition 
is  met  as  a  rule  (but  not  always). 

The  following  consequences  important  for  practice  stem  from  these  properties: 

1.  It  is  possible  when  seeking  the  population  of  non-worse  systems  (the 
optimum  surface)  to  use  the  determination  (computation)  of  the  primary  (secondary, 
tertiary)  operating  surface,  i.  e.,  to  find  a  (22.30a)-type  dependence.  This 
operating  surface  (and  the  type  I  and  II  performance  curve  family  corresponding 
to  it)  will  comprise  all  the  non-worse  points  (systems)  and,  in  addition,  may 
comprise  several  worse  points  (systems),  which  must  be  screened  out. 

2.  If  the  type  I  and  II  performance  curves  corresponding  to  the  operating 
surface  are  monotone  nonincreasing  curves  in  the  entire  range  of  possible  (of 
interest  to  us)  quality  index  values,  then  they  will  not  comprise  worse  points 
and,  consequently,  no  screening  is  required. 

3.  When  meeting  the  strict  monotonicity  condition  in  the  preceding  paragraph, 
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each  point  (system)  of  the  performance  curve  family  is  non-worse  and  has  all  the 
aforementioned  properties  of  non-worse  points  (systems),  the  following  property 
in  particular: 

a)  for  the  given  index  l<2  and  kj  values,  it  provides  the  min imum- possible 
(among  all  possible  systems)  index  k^  value; 

b)  the  minimum  k2  is  supplied  for  the  given  kj  and  k^; 

c)  the  minimum  k^  is  supplied  for  the  given  k^  and  k2* 

We  will  examine  as  our  example  design  of  a  binary  detector  system  for  unknown 
a  priori  signal  presence  and  absence  probabilities. 

In  this  case,  mathematical  system  design  requires  consideration  of  three 
quality  indices:  signal  miss  probability  Pn 9  ,  false-alarm  probability  /437 
Par  ,  and  requisite  signal-to-noise  ratio  q.  The  less  the  value  of  these  indices, 
the  better  the  system.  Therefore,  it  is  possible  to  designate 

(22.32) 

It  follows  from  what  has  been  stated  that  it  is  possible  during  the  system 
design,  without  cfanger  of  losing  optimum  decisions  (non-worse  systems),  to  minimize 
any  of  these  indices,  assuming  that  the  rest  are  fixed  (but  random)  magnitudes. 

When  using  the  Neyman-Pearson  criterion  examined  in  Chapter  14,  magnitude 
Pn  is  minimized  for  fixed  values  of  Pm  and  q,  i.  e. ,  the  operating  surface  equation 
will  be  found 

(22.33) 

and  the  performance  curve  (22.31a)  and  (22.31b)  families  corresponding  to  it. 

The  type  I  and  II  performance  curve  family  of  the  first  type  qualitatively  is 
depicted  in  Figure  14.5,  while  it  is  depicted  quantitatively  in  Figure  9.6  (for 
a  random-phase  signal).  However,  magnitude  (1—  k^),  rather  than  k^,  is  plotted 
on  the  X-axis  in  these  figures.  The  family  of  Figure  9.6  performance  curves  plotted 
in  coordinates  (k^,  k2?  has  the  form  depicted  in  Figure  22.9a. 

Baaed  on  this  figure,  it  ia  not  difficult  to  plot  the  corresponding  type  II 
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Figure  22.9 

performance  curve  family  depicted  in  Figure  22.9b.  Ae  is  evident,  any  curve  of 
these  families  has  a  monotone  nonincreasing  nature.  '  Here,  all  non-worse  points 
(systems)  differ  only  in  threshold  bias  UQ  magnitude. 

The  performance  curves  depicted  in  Figure  22.9  correspond  to  a  signal  with 
known  amplitude  and  random  phase.  For  a  fluctuating  signal  where  Pap  <  o.i,  the 
operating  surface  is  determined  from  relationship  (9.83),  from  which  it  follows 
that  (where  Pap  <  o.i) 


Papas 


lad  /Pm 


where  QopW,  . 

In  this  case,  it  is  evident  even  without  graphic  representation  that  magnitude 
Pap  is  monotone  nonincreasing,  both  when  Par  rises  (where  q  =  const)  and  when 
q  rises  (where  Par  «  const),  i.  e. ,  type  I  and  II  performance  curves  have  a 
monotone  non increasing  nature. 

Thus,  the  necessary  and  sufficient  condition  of  complete  operating  surface 
coincidence  with  the  optimum  surface  is  met  in  all  the  cases  examined.  This  signifies 
that  a  family  of  performance  curves  of  any  type  (i,  e.,  types  I  and  II)  will  comprise 
all  possible  non-worse  points  (systems)  and  will  not  comprise  a  single  worse  point. 
Hence,  the  following  important  conclusion  stems  from  consideration  of  property 
4  of  non-worse  systems. 


A  detector  system  designed  in  accordance  with  the  Neyman-Pearson  criterion  /438 
provides,  not  only  the  minimum  magnitude  Pn p  value  for  given  q  and  pat,  but  the 
minimum  Pur  value  for  given  q  and  Popand  the  minimum  q  value  for  given  Pur  and 
Pap  .  Moreover,  it  follows  from  the  aforementioned  that,  in  spite  of  the  fact 
that  system  design  is  accomplished  through  minimization  only  of  one  of  three  quality 
indices  (magnitude  Pap  )  when  the  Neyman-Pearson  criterion  is  used,  results  obtained 
may  not  be  improved  if  all  three  quality  indices  (  P*T  ,  Pap  ,  and  q)  are  considered 
together  during  system  design  (selection  of  the  best  system). 

We  examined  the  method  of  finding  the  optimum  surface  (non-worse  system  pop¬ 
ulation)  by  finding  the  operating  surface  equation.  However,  as  was  true  in  the 
case  of  two  quality  indices,  there  are  other  approaches  to  finding  the  optimum 
surface.  In  particular,  it  is  possible  when  meeting  the  corresponding  convexity 
conditions  to  use  the  quality  index  linear  sum  minimization  method 

™  + ei  *t + ct  kf, 

where  weights  c^  and  c^  range  from  0<C|,*<  oo. 

In  conclusion,  we  must  note  that  the  theory  of  radio  system  design  based 
on  several  quality  indices  is  only  in  the  initial  stage  of  development  and  the 
material  presented  in  this  chapter  should  be  looked  upon  only  as  an  introduction 
to  this  theory. 


APPENDIX 


|Jj  xa«U«>  table  of  distribution 


i  ^ 

«.» 

•.  M 

Ml 

B9 

0.00 

*.76 

I.M 

1.11 

Ml 

0.10 

O.M 

0.00 

0.01 

HP 

mm 

o.oox 

0.001 

* 

0.020 

ESS 

0.109 

m 

0,713 

m 

2.41 

3.22 

4.60 

5.99 

7.82 

9.21 

11.6 

12.4 

13.8 

:j  4 

0.297 

0.429 

0.711 

m 

SEE 

2.20 

3.36 

4.88 

5.90 

7.78 

9,49 

11.67 

13.28 

14.9 

16.9 

13.5 

i  6 

0.372 

1.134 

EE 

2,20 

3,07 

3.63 

5,36 

7.23 

8.56 

10,64 

12,59 

15.03 

16.81 

18.6 

20.7 

22.5 

■a 

1.640 

2.03 

2.73 

3,49 

4,59 

m 

7.34 

9.52 

11.03 

13,36 

15.51 

18,17 

20.1 

21.9 

24.3 

26.1 

5  10 

2.S6 

3.06 

3.94 

4.86 

6,18 

7.27 

9.34 

11,78 

13.44 

15.99 

18,31 

21.2 

23.2 

25.2 

27,7 

29.6 

%  12 

3.57 

4.18 

5,23 

6,30 

7,81 

9,03 

11.34 

14.01 

15.81 

18,55 

21.0 

24.1 

28.2 

28.3 

31 

32.9 

4.66 

5.37 

6.57 

7.79 

9,47 

10.82 

13,34 

16.22 

18,15 

21.1 

23.7 

26.9 

29.1 

31 

34 

36.1 

H  JL 

5.61 

6.61 

7.96 

9.31 

11.15 

12.62 

15.34 

I3m 

20.5 

23,5 

28.3 

29.6 

32.0 

34 

37 

39,2 

1  » 

7.02 

7.91 

9,39 

10.88 

12,66 

14,44 

17.34 

20,6 

22,8 

26.0 

28.9 

32.3 

34,8 

37 

fm 

42.3 

K 

5.26 

9.24 

10.66 

12,44 

14.58 

16,27 

19,34 

22,3 

25.0 

m 

31,4 

35.0 

37.6 

40 

43 

45.3 

0.54 

10.60 

12.34 

14,04 

16.31 

18,10 

21,3 

24,0 

m 

30.8 

38.9 

31.7 

40.3 

42,5 

46 

48.3  ! 

10.66 

11.99 

13.85 

15.66 

18,08 

19.94 

23,3 

HI 

29.6 

38.2 

36.4 

40.3 

43.0 

45.5 

48.5 

51.2 

§^H”i 

12.20 

13,41 

15.36 

17.29 

19.82 

21,8 

25.3 

29.2 

31,8 

35,6 

38,9 

42.9 

45.6 

48 

51.5 

54.1 

9- 

13.86 

14.65 

16.93 

16.94 

21.6 

23,6 

27,3 

31.4 

34,0 

37,9 

41.3 

45.4 

48,3 

51 

54.5 

56.9 

1  30 

14.96 

16.31 

18.49! 

20.6 

23,4 

29.3 

33,6 

ol 

40.3 

43.8 

48.0 

60,9 

54 

57.5 

59.7 

i 

■fen 

629 

BIBLIOGRAPHY 


I.  KOTIIMMOl  B.  A.  TeopRH  nOTeHQaMbHOfi  IlOMftXOyCTOftlHSO- 
cth.  rooBeproaaxar,  I960. 

‘  2.  B  y  x  a  o  p  x  <9.  M.  TeopRH  aepoaTHocjeA  ■  reopaa  HwpopMMa  c  npa- 
MCRCBHRIIR  •  paXROXORaRRH.  top.  C  lira.  HSX-BO  tCOMTCROO  puaot,  1958. 

8.  Jl .  A.  i3ytxoi  B.  A-  BuxexeaHe  carnaaoa  aa 

4om  cxyeaftnu  bomcx.  Hu-i a  cCoaereRoe  pexao*,  I960. 

4.  XipiiiRi  A.  A.  OacpxR  o<5meft  TeopRH  cbrsh.  rocrexRUaT,  195$. 

8.  OixtxoiRi  G  B.  upaeu  paxaojoxitinoHaux  carnaxoa  aa  ^.oe 
(payrrviHRoaaux  nonex.  Hax-ao  tCoaercaoe  p  exact,  1961. 

6.  Aoayxaxpa  M.  n.  Bbcxchuc  a  reopaio  nepexaiR  awpopitaiiaR  no 
MCKTpaqecxNM  mimim  caaiR.  Caxahiurr,  1958. 

7.  C  m  R  p  h  o  a  *B.  A.  Ochobu  paxRocaaaa  aa  YKB.  CaxsaHXXaT,  1957. 

8.  roikxaaa  G  Teopaa  hr$o;:  Manna.  top.  e  aari.  Hsx-ao  Haocrpaa- 
Hop  aarepaTypti.  1967. 

9.  «noporoaue  carauu*.  top.  e  earn.  Hax-ao  tCoaercaoe  pexao*,  1952. 

10.  U  a  a  a  C  »  a  • «  a  a  a.  Texaaaecaaa  aaOepaeTaxa.  top.  e  aara.  Hax-ao 
aaocrpaaaoft  aarepatypu,  1956. 

II.  tlyrini  B.  G  .Teopna  cxycaftHM*  tyaanaA  a  ee  npaMeaeaae  a  aa- 
xaeaM  aaroMaTaaecKoro  peryXapoeaaaa.  rocrexaaxaT.  1957. 

12.  Coaoxoaaaaoa  B.  B.  Baexeaae  a  craracTatecnya)  xRaaMaay 
cacrea  aaroperynapoeaaaa.  roaexaaxat,  1958. 

13.  n  e  p  o  a  B.  n.  OravaennacRRA  caarea  Rany/ibcaux  crctcm.  Hax-ao 
‘tCoaercaoe  paRnot,  1989. 

14.  Baeiyaia  A-  *  T  a  p  m  a  a  M.  A.  Teopaa  un  a  oraracnieecKax 
pemeaRt.  nep.  c  aarx.  Hax-ao  aaoerpaaMl  aareparypu,  1958. 

15.  *C0oBrrr  rpyxoo  aayaao-rex aaeecaoro  o6-aa  paxaorexHaaa  a  axexrpo- 
caaaa  rm.  A.  C.  IloOoaai.  Ilex  pax.  B.  H.  Ca+opoaa.  Bun.  II,  III.  roc»HeproR3XaT, 
1958,  1959. 

16.  tnpjNM  chtrxxob  opH  RMaaaH  myita>.  Ctfopanx  nepeaoxoa  nox  pea. 
Jl.  C.  ryraaaa.  Hax-ao  RHOcrpaHHoA  JtnreptTypw,  I960. 

17.  tTeopRR  RHtopMaiRR  R  ee  npaaoxeaRn*.  Ctiopnaa  nepeaoxoa  noa  pex. 
A.  A.  X  apace  Rea.  ®ejnrrrne,  1989. 

18.  tTeopRR  ncpeXaea  aatarpateceax  caraaaoa  npa  HMaeaa  none**.  Cfiop- 
rrk  nepeaoxoa  box  pex.  H.  A.  JKejteeaoea.  Hu-ao  aHocrpaaaoft  xaTeparypu,  1953. 

19.  tnpRCH  RMRyxbCRUX  carnaxoa  a  npacyrcTaaa  uymob».  Ctfopana  nepeao- 
xoa  nox  pax.  A.  E.  Bamapaaoaa  a  M.  G  Axeacaaxpoaa.  foceaepronexer,  i960. 

20.  B  y  I’la  o  a  B  e  B.  H;  •xyaTyauaoBaiie  nponeccu  a  paxaonpaeMaux  . 
ycrpoftcraax.  Hax-ao  tCoaercaoe  paxROt,  1981. 

21.  iluiR  B.  P.  TeopRH  cxyeaiRUx  npoaeccoa  x  ee  npRiMBeaHe  a  pa- 

XROrexRHRe  Hax-ao  tCoaercaoe  pexao*,  i960.  .  **e  o»» 

22.  n  e  0  e  X  a  a  B.  Jl.  Cayeaflawe  npooeccu  a  MeaTpaeecaax  a  uexaea- 

eecRRX  cacTcttex.  •aaMtmia.  1988.  toakp*«r 

23.  R  b  a  m  a  p  T.  MaTtReraeecaae  mctoxu  craracTaRa.  top.  c'aarx.  Hax-ao 

aaocTpaaaoa  xaraparypu,  1948.  .  o 

24.  Caapaea  H.  B.  a  Ay  a  a  R  •  B  a  p  r  o  a  q  x  r  ft  H.  |B.  Kparaaft 
acgrgc  wareMaTHeeexoft  crrrncrann  x*R  rexRaeecxax  itpRxoaceaRft.  ♦nmerraa, 

25. raexcRRO  B.  B.  Kype  TeopRH  atporraocreft.  rocrexRSxer,  1950. 

•  28.  BeRTxexa  E.  G  Teopaa  aepoafBoereft.  (OaiMttraa,  4958. 


/440 


630 


97. '  Pm*  1 1  H.  M.  a  rpiimTdN  II.  O.  Ts&xaau  NirrerpsnoR, 
cym.  oaxoe  *  npoaseeieanA.  rocrexHSxar,  1951. 

98.  W  i  •  n  A  r  N.  Extrapolation,  interpolation  and  smoothing  of  stall* 
onary  time  aeries.  John  Wiley,  New-York,  1949. 

99.  B  a  m  b  g  A.  nocaexoMTexsHuA  ananas.  ♦HSMairm,  I960. 

30.  W  a  I  d  A.  Statistical  decision  functions.  John  Wiley,  New-York. 

1947. 

31.  Miuitsi  A.  Baexeaae  a  CTarncrnRecayio  Teopnio  esasn.  T.  1  n  II. 
HsA-ao  tCoMTCxoe  puma,  1961,  1969. 

32.  Aaseuuopr  B.  B.,  Pvt  B.  Jl.  Beexenm  a  Teopatocxyealaux 
cnrauoa  h  myna.  Hu-ao  anocrpaenoA  Jinreparypu,  I960. 

33.  Kotissiiioi  B.  .A.  O  nponycaaoA  cnocodnocra  espnpa*  a  npo- 
aoaoaa  a  saerrpocaasa.  Bceeososa.  saepr.  bom.,  Marepaasw  a  nepaooy  acecotosao- 
My  ceesxy  no  soap,  peaoacip.  Mae  cease.  Hu.  ynp.  cease  PKKA,  1933. 

34.  K  oa  m  o  r op  op  'A.  H.  Harepnoaepoeaaae  a  saerpacoaapooaaae 
craaaoaapaux  cayeeAaux  aocaeiooareaseocTeA.  Hseecma  AH  COCP.  cep.  mrt«- 
MSTaaecKaa,  1941,  t.  S,  M  I. 

35. Capopoe  B.  H.  O  aaaaaaa  none*  aa  npacn  nunyabcnux  paxno- 
caraaaoe.  «Pa«aomaaaaa,  1946,  Ml. 

36. Beaoyeoe  A.  fL  O  naaetscnieA  peaasaoA  eyecTsanaenocTa  npn- 
enaaaa.  «P«iaorexaaaas.  1946,  M  5. 

37.  KoTeabaaaoa  B.  A.  IlpodaeMU  noMexoycroAeBBOcm  paxno- 
esasa.  PaxaoTexaaeeenA  cOoonna.  rocs  aepro bust,  1947. 

36.  Xeiseigs  A.  A.  Odaapyxeaae  caaOux  caraaaoe.  tPaxaoTexnn- 
aa*.  1983,  M  6. 

39.  KapaoeeaaA  M.  H.  O  noxaaneaaa  ihayrryanaoBBtax  nones  npa 
aopp— aanoaaoti  neroae  apaena.  «PaxaoTexaaaa>,  1954,  M  3. 

40.  BoaaxaA  B.  C  Odaapyxeaae  caa6m  caraaaoe  cnocofion  acaa* 
xpoaoera  aaaoaaeaaa.  •PaxaorexHRaas,  1954,  MS. 

41.  ♦  a  »  ■  a  ■  •  a  Jl.  H.  O  nooexoycToAmtBocrn  nnnyskcaoro  paxao- 
apaonju  ^aaaeeexana.  1954,  «k  6. 

4*.  V  p  a  o  a  a  a  r.  Baaerpu  xaa  efisapyaceaaa  caa6ux  paxaojioaanaoa- 
asis  raraaaaa  aa  #eae  Mamma*  orpaaceaaA.  tBonpocu  paxaoaoaanaonaoA  rex* 
ao aw,  1964.  Ml 

4A*a*aaaee  JL  H.  nonaanaaeasa  noMexoycToAaaeocre  npa  npae- 
mb  aooyauBaia  pern eeaiaeaee.  «Paiaorexanxa>,  1955.  M  10. 

44.  BsiTeea  r.  A.  Hocaeaooaaae  aOMexoycToAaaaocTa  nepexaaa  Bo¬ 
nsai  Toaeyapaaaoaaa  Metoaasia  reopaa  noreanaaaaaoA  noMexoycToAaaaoera. . 
•AasowraM  a  reaaimaaaoaa*,  1995,  M  4. 

49,  ailxoilial  B.  H.  npaen  RMnynwcnux  caraaaoe  no  neroxy 
Busasal  aoppaaaaaa.  sPeanorex ss aw>  1985,  M  6. 

46.  Boaatp  A.  a A  p  e  a  a  a  P.  tlpoAseMS otrraMaabaoro  odaapyaceeaa 
■Mayaeeasse  caraeoao  a  am.  eBoapocst  paxaoaoaanaoanoA  Texaaaat,  1956,  M  5. 

47.  HiAasetaal  B.  H.  nomxoycroAaaeocn,  pnatTpoaoro  asroxop- 
poaaaaoMnaro  apomaaaa  awayasraiix  caraaaoe.  «Paxaoiexaaxa»,  1956.  M4. 

49.  Isaiiaate  A.  B.  O  BOMexoycroAsaeocra  aoppexaaaoaaoro 
momm  apmsa.  Jhmmmmmm,  1996,  MS. 

49.  3  •  a  •  A.  r,  (laneacpt^dKaaoctk  a  e##eara»aocT>  lotonaerpapaoA 
paxaecaasa  apa  ♦ayaryeaaaaaiu  aonexas.  «Paxaor«xaaaa»,  1956,  M  9. 

90.  3  m  a  a  A.  t.  Aatee  ipipir  o  paeeere  nomxoycToAeasoeTa  puao- 
naaeMaaaa  apa  leaemsn  ypaeae  payaryaaaoaawx  aowex.  sPaiaoTexeanas,  1986, 


/441 


none!  npa 


41  Beaaep  A.  a 

,I47llH»l"si"i,|l1 


3  »  a  •  A.  f . 

i  apa  6obmmm  ; 


51.  X  a  p  x  e  a  a  *  A.  A.  0  emaeaeaaa  eaearpoe  ofyeaAeux  apeaweae. 
tPaxaorexaaaas,  1997,  Ml 

.  52.  Xexeiati  H.  A.  O  apaaaaaaeaeaux  eoapoeax  reopaa caraaaea 
a  saxaesx  ce  luuriasre  Manama  aa  ocaoae  aoeoA  CTOxacraeecaeA  aeasaa 
(PaAnorexnaxa*,  1987,  M  1 1. 

63.  KapsosMcaaA  M.  H.  a  MaAkoacaaA  B.  Ik  Ami 
noetnaeaaa  ncMexeycroleaeoera  anroneppoaanaoaaoro  npaewa  amayascaM*  ear- 
aaaoa.  sPaxaeresaanas,  1997,  Ml 


7 


54.  Caepa.aar  n.  Maacatuabnia  Tonnocrb  onpcaaataaa  noopuur 
■MoyabCMoA  puaoaoaaaaoHnoA  crtiiuRB.  tBonpocu  ptABoaoManoano#  manna*, 
1957,  M  2. 

56.  T  •  n  a  o  ■  H.  71.  K  obabko  noMtxoycTOltaadcra  nemoo  pumipm- 
Ma,  ochombbux  m  ycpcABeRRR  (pyHxnaA  carHtaa  a  nontax.  «PaxaoTtxaxaa*, 
1957,  M9. 

56.  «  a  h  k  e  x  uu  t  e  A  a  M.  H.  riepcxoARUt  nponeceu  ■  rpeSeanaTux 
<paaaTpax.  «PwROTexaaaa»,  1957,  M  7. 

57.  9  y  p  a  ye  a  B.  B.  0  Hexoropiix  maamax  Ttopaa  earaaaoa.  «Piabo- 
TexHHxa*,  1957,  In  4. 

58.  3  adept.  06mae  saxoaoMepnocTa  oflaapyxctxxa  aaatA  apa  nomosnn 
paAaoaoxanaa.  tBonpocu  paxBoaoxanaoaaoft  Ttxaaam,  1957,  Mb  5. 

59.  Xapxtaxa  A.  A.  05  oahoA  exam  npaaua  earaaaoa.  «3aerrpo- 
caaaat,  1957,  Mb  2. 

60.  X  a  p  x  e  a  a  a  A.  A.  "O  reoperxneexa  omanaMaaft  cacrtna  caaaa. 

tSaexTpocansk*,  1967,  Mb  6.  . 

61.  XipxeiM  A.  A.  0  aosMoacaocrax  axtTxa  eaaxrpa  earaaaa.  tSatx* 
Tpocaxm,  1967,  Mb  4.  > 

62.  OixkioiM  C  E.  O  Toaxocra  oreatra  aoopxaaani  xanaocra 
a  paAaoaoaaaaoaaiix  cacranax  npa  aoxoreptaraoM  xaxoaataax.  nPaiaoraxxau 
X  MtXTpOBKKU,  1957,  MS. 

63.  OatlnRai  B.  C  05  oaraiwjiaaoM  Atrtxropa  e  Ion  IrxaptRTt- 
pacraaofl  xaa  oSaapyxceaaa  caaSoro  earaaaa  apa.  aaaaaaa  myma.  tPaiaor axaaaa 
a  aatmpoaaaaa,  1957,  MS. 

64.  71 1  o  a  o  a  K).  IL  i  Tubieaie  71.  n.  Oaaaaa  aapautTpoa 
aaaoaa  pacapaaeaaaaa  eayaalaot  fyaaaaa  apa  orpbaaaaaaaia  tapaapnui  aaa- 
aax.  cAaroMaraaa  a  leaonxaaaaaa,  1987,  Mb  11. 

.  (5.  Xipatiiaa  A.  A.  0  Taopaua  Koraavaaaoaa.  tPmoraaaaa*, 
1985,  M8. 

66.  TypOosaa  H.  T<*X  aonpoey  o  opaneaeaaa  Ttapaim  Kaaaawmaoaa 
a  ^yaaaaaM  apcwRn  c  orp  uneaauu  cnaatpow.  aPaxaeiaxaaaat,  1985,  M  8. 

87.  Bomaaaft  B.  C.,  Cajraioiexil  A.  It  Cxtuajux  asNtpt- 
xxx  eaaSMX  earaaaoa  co  cnaomxuu  cnrrpoM.  tPaxaemanaat,  1965,  M  9. 

65.  M  a  m  xo  itial  K.  A.  Boapoeu  nouaioycTOfttaaacra  caciew  cam, 
aeymaetaaamaax  apatn  earaaaa  a  tataout.  cPaxaoraxaaaa*,  1968,  M  6. 

89.  II  o  a  a  a  10.  B.  aKaaaaoa  B.  C.  X  Ttopaa  oSaapyattaaa  napa- 
oixaaexax  aaayawaux  earaaaoa  a  rayecosoM  amat  apa  aoaoropoaraoM  aaxon- 
ataaa.  cPaxaortxaau  a  tacarpoaaata,  1958,  Me. 

70.  Tapacoaao  4b.  II.  05  nn$op*irrniHOCTR  napauerpoa  npaaRMtt* 
aoro  earaaaa.  aPnaotaxaaaa  a  MaXTpoaaxa*,  1958,  M  4. 

71.  7m  a  a  a  X>.  C  O  eaxrtae  jmabTpoa,  onrxuaabRUx  XHnyaacaM  onpe* 
Ataeaaol  fSpuu.  Hayaaua  xoxaaiu  BucmtA  mxoaw,  cepaa  «Paxaortxaaaa  a 
taeRTpORRRta,  1988,  Mb  8. 

72.  Ca^opoa  B.  H.  0  aponycaaoft  enoeoSnocnt  aaaaaoa  cam  e  mi* 
aaaauua  cayuiauMX  amtataanux  napanrrpoa.  HaytBut  xoxaaxu  BucmtA  mao* 
au,  enn  aPaiaoTaxaaxa  a  aaamoaixat,  1988,  M  3. 

73.  ♦aaaaaoa  71.  H.  raaaaaaoa  paiaonpatMaot  ycrpoAcrao  auaaat* 
aaa  earxaaoa.  Hayaaua  loaaaiu  BucmtA  maoau,  cepaa  aPtAaortxaaaa  a  sata* 
rpoaaxat,  1968,  M  2. 

74.  JKa  a  «  a  a  o  a  H.  A.  Tlpanaan  lacaperataana  eroxacrattcxax  ear* 
aaaoa  e  atorpaaattaauu  entarpou.  nPaiaortxaaaa  a  aatRTpoaaaat,  1958,  M  1. 

75.  3  y  6  a  o  ■  B.  A.  Obtxmmiiboc  oSaapyatenaa  apa  aopptaapoaaaaux 
aomxax.  nPaxaonxaaaa  a  taaarpoaaaa»,1958,  M  12. 

•  78.  Un oaaaaa.  OSaapyxtaat  aunyabcaux  cxraaaoa  x  myuax.  «Bo* 

opocu  paiaoaoxaaaOKaoA  maaxa*,  1956,  M  3. 

77.  M  x  a  a  op,  Btpamroia.  Ttopaa  xartptxTRoro  aararpxpoaaaax 
aw  apamaaaaa  a  oonapyannaio  earaaaoa.  tBonpocu  paixoaoxasHOxxoA  maxim, 
1958,  MS 

78.  BaaeSaar.  nocauoaarcataot  oSaapyxttaat  a  rayeeoaou  mym. 
tBoopoeu  paiaoaoaauoaaol  maaxm,  1956,  M  4. 


/  442 


632 


r .* .  ■  ---■<.' 


79.  flolpyaiH  P.  71.  Oahi  craTMCTNiecKan  auaia  reopHa  oSaapyrnr- 
aaa  cm-Huta  Ha  (pOHe  uiyMa  a  MHoroaaaaabHoA  cacreae.  cTeopaa  aepoarHocTR  n  re 
npHMetwHHHt,  1959,  M  2. 

tO.  Xapxtui  A.  .A.  OnomaMMe  oflpasoa.  «PaAHOTexaHKa»,  1959, 

MS. 

81.  9xxxtxiuTtti  M.  H.  OnTHMajibHue  noaocu  nponycxaaaa 
aacror  rpeteHaarux  (paxhtpoa.  (Piahotcx hr as*,  1959,  M  I. 

S2.  9iakxoiM  C.  E.  HeKOTopue  peayabTiru  npHMeHCNRH  xtetoxa 
anocrepHopHoft  aepoarHOCTH  k  waxiM  npoeKTMpoaaHHR  paAHoAoKaaaoHHWx  ch- 
creM.  tPaxHOTexHHxa  h  aaeirrpoNHxa*,  1959,  M4.  . 

83.  3  y  6  a  k  o  a  B.  A.  OOaapyaceaHe  carKaxoa  Ha  <|>oh«  hopmxjuhux  myMoa 
h  xaoTHaecKHX  orpaxeHHfl.  <PuNorexMHKa  h  rniearpona aa*,  1959,  M  I. 

84.  3  y  6  a  x  o  a  B.  A-  OOaapyaceHite  aorepeitTHHX  caraaaoa  Ha  <poite  xop- 
peanpoaaHHUx  nowex.  iPMRoraxHHxa  h  mcrtpohhr8>,  1959,  M4. 

85.  Baoapxxoa  A.  E.  xOxetoiMax  B.  C  npamaeaaa  aero* 
ia  nocacAOBarexaRoro  axuxM  a  cHcreMax  xayxiHiaaoA  nepexaaa  npa  pexeeacxHX 
-^ayKTyaoHxx  rmtchchbrocth  carnuoa.  aPuBorexaiiaa  a  aaexTpoaaaaa,  1959, 

86.  Cparoaaa  B.  I*.  06  eirrHiuaaaoM  oOaapyaccaaa  caraaaa  aa  Aoae 
KoppcaapoaaaaoA  rayccoaoA  noMexa.  (PaAaotexaaaa  a  aaeirrpoaRxaa,  1959,  M  5. 

87.  BiIxdtiIi  71.  A.  PaiaoxoaanaoHaoa oOaapyxaaae mepaaawtero 
o6aena»  aa  <pOHe  aoppMHpoaaann  nottex.  tPaxaoTexaaaa  a  amarpoaBaa*,  1959, 
MS  a  M7. 

88.  Bxaipxxai  A.  B.t  •xtAnxixB.  C.  Cimoxhr  a  M.  A. 
Baaapaaw  Haaonarcaaam  caercMM  e  aayxnoporoaiwR  aaajxBaaropaHR.  «Paxao- 
Texaaxa  a  axeaxpoaaxa*.  .1989,  M  9. 

89.  T  y  f  a  a  a  71.  C  O  aaaaaaaa  aparapaea  oaraiiaaaaocTa  a  aapaopaux 
jvacnpexexcHRA  a  rcopaa  apama  earaajiaa.  <Paaaorexaaaa  a  Merrpoaaaa»t  1989, 

*90.  ♦  a  a  a  71.  M.  O  ooiaaaaMaaoA  noMexoycToAaaooera  apa  saaapaaaax 
caraaaa.  <Pa*aattxaa«a*,  1981.  MO. 

91.  ♦  a  a  a  71.  M.  O  wmaoaaMaoft  iioMcxoycTolaatocTB  apa  aeoapeaeaea* 
aoA  Ana  caraaaa.  tPaAtoroxaHaa*.  1989,  M  I. 

92.  Tapaciaao  9.  FI.  Cpuaaaaa mtoRoa paxaoaoaaaaaaaoro apacaa 
c  toaaa  apaaaa  taopaa  anAopMasaa.  ePaaa<mxaaaa»,  1989,  M  7. 

93.  B  a  pax  a  a  a  p  B.  A.  K  taopaa  napcaaaa  caraaaoa  co  Maoraaa  aa- 
caparauMR  saaacaaaxni.  xPaaaoraxaaaai,  1969,  M  I. 

94.  K  a  a  a  a  a  a  71.  3.  nomxoycieftaaaocTh  aaToaopptaaaaoaaera  apa* 
caaaas  AM  caraaaoa.  «Paxaotexaaaa»,  1989,  MO.  , 

95.  T  y  p  6  o  a  a  a  H.  T.  AaaaaTaaecaoa  apaacraaaaaaa  pyaaaaA  apa* 
aaaa  c  aeorpaaaaaaaiiH  caaarpoa.  iPaxaoraxaaaa*,  1989,  M  3. 

96.  X  y  p  r  a  a  ft.  H-  Oaaaxa  aponycano*  caocoOaocra  aaaoropin  aaaa- 
aoa  caaxa  co  cayaatao '  aaaiaaaaiaiaMHca  aapaMaTpaaa.  «Paaaofasaaaai,  1969. 

12. 

*97.  A  a  t  a  a  10.  9.  rpaOaaaarwA  #aaarp  e  aaaarpeaami  aawwyiaiapoa. 
tPaxaonxaaxa  a  aaearpoaaaa*,  1959,  Mil. 

96.  A  o  6f  y  ax  a  a  P.  -71.  OimwaabRaa  atpoana  aapepiiaaaa  aa  aaaaay 
c  aaaaaacrautia  napatMTpaaa.  tPaaaoraxaaaa  a  aacarpoaaaae,  1919,  M  12. 

99.  UIap4iaa  ft.  A.  Taopaa  oOMpyawaaa  aoacaaora caraaaa  aa  pooa 
rayccoatiR  my  mob  a  npoaaaoaaaoro  cocoa  wamtanaax  caraaaoa  ea  cayaaaaMwa 
aaaaaTyAaaa  a  Maaaaamm^aaaMa.  tPuaotaxaata  a  aaaarpaaaaa*.  1969,  M  12. 

100.  M  a  p  a  a  a  10.  B.  O  aaaoropwx  cnocoOaa  o6pa6oraa  ♦ayarjaayioaxax 
caraaaoa  a  xayxaaaaaaaux  cacraiax.  <Paaaoraxaaaa  a  aatarpoaaaae,  1999,  M  12. 

101.  Ay6aacaa«  B.  A.  K  aoapocy  o  tohroctb  aamptaaa  aaaantrpoa 
aoaaOaaaa,  acaaxaaaoro rayccoaoA  nomxoA  aaaol  aatwcaaaocni.  aPaaaom* 
aaaa  a  aaaarpoaaaat,  1989,  M  12. 

102.  Y  o  a  T  a  p .  Koaaaacraaaaul  aaaaaa  utomitixmxxx  cx«m  otfaapyata* 
aaa  a  oaaaxa  noaoataaaa  aapoanmax  aweA  npa  aaainaa  oiyaa.  «Paxaor*xaaaa 
a  aaaarpoBaaa  aa  pyOaatow,  1989,  M  1. 

103.  3  a  6  a  p  v.  Haaoropua  apamaaaaa  taopaa  oOaapvataaaa  a  paxaoao* 
aaaaa.  cPaaaoroxaaaa  a  aaaatpoaaaa  aa  pyflamow,  1959,  M  1. 


/443 


T.T  4Tfc_or\ 

Vi  * 


/AAA 


104.  K  a  m  h  p  k  h  6.  A.,  til  across  f*.  A.  Bosnojkhocth  nbabituexas 
noMCSoycroianocTH  hs  ocnose  HcnoabSOBSHHa  pacnpexejieiia*  sepoariiocTeft  naps- 
uctpa.  «Abtomst*ks  a  teneuexsHHKSs,  1959,  MS. 

105.  B  o  lo  tt  a  a  ft  B.  C  Q.cpsBHXTenbHoA  noMexoycTOflmsocTa  Aayxxsnsab- 

aoro  KoppexanHOHHOro  npaeMRaxa  c  KBsapSTae hum  xrreitTopoM.  •Psamotcxmik  at, 
I960,  1, 

106.  r  y  t  k  a  a  Jl.  C.  HexoTopue  cooraouwaa*  ■  onrawMbaux  cacTCMax 
oOaapyxeaaa  carBMoa.  fPaAnotexaaxas,  I960,’ M2  a  »4. 

107.  M  e  p  a  a  x  JO.  B.  npa&naxeBauft  Kerax  pacxera  xapaxTepacrax  - 
oOaapyaceaaa  MHoroxaaaxbHUX  ckctcm  c  xoppexapoasanuMR  inyxaMR  npa  oiOope 
aMiuaTyx  no  aaaOonbineMy  saateaaio.  «Paxaorexnnxa  a  saextpoHHKa*,  1960,  M  2. 

108.  Hiti  we  a  B.  H.  00  oornnsabiioM  cnocofte  onpeAeaeaita  apeMea- 
aoro  noaoaceaaa  aunyabcs.  (Paxaorexaaxa  a  saexTpoHHxa*,  I960,  M  2. 

109.  Biaapaioai  A.  E.  OOnapyaccaae  aanyabcaux  naxeroa  cayaafl- 
aoft  npoxoaxrrexbaocTH  yctpoAcTssMn  c  xoaeaaoll  eaxoma  nawaTa.  tPaaaorex- 
aaxa  a  aaentpoauxas,  1960,  M3. 

.110.  Map  a  ax  JO.  B.  OOaapyxeaaa  carnaas  c  aaaaBecTaoft  isctoto# 
a  npoasaoxbaoft  naesabaoi  qnmi  aa  ipoaa  uiyua.  ePaAaotexanxa  a  MaxTpoHnxa». 
I960  M  3. 

'ill.  Maabaaxoa  B.  C  Paaaoc  cnraaaoa  eacTOtnoro  Teaerps$aposa- 
aaa  npa  saMapaaaax  a  axeajibaoM  npaem.  «3mxTpocaasb»,  i960,  M3. 

112.  T  y  f  a  a  a  Jl.  C  3aaacanocTb  napanerpoa  AetexTnpoaaana  a  npeoO- 
pasoaaaaa  aacraru  or  aanxeryx  caraaxa  a  retcpoaRaa.  «PBAnoTexanxa»,  1946,  M  6. 

1 13.  N  o  r  t  b  D.  O.  Analysis  of  factors  which  determine  signal-to-noise 
discrimitation  in  pulsed  carrier  systems.  Rep.  PTR-6C,  RCA,  Princeton,  1943. 

114.  Dwork  B.  M.  Proc.  IRE1,  I960,  v.  38.  p.  771. 

115.  r  p  e  a  a  a  e  p  ^y.  Cayeaftaue  npowccu  a  craracTaaecxae  auaosu. 

Hsx-eo  BRocrpaaaoft  AatepaTypu,  1961.  . 

'  -  116.  Z  a  den  L.,  Ragaitlnl  J.  An  extension  of  Wieners  theory  of 

prediction.  J.  of  Appl.  Phys.,  1950.  a.  21,  M  7. 

117.  Boo  ton  R..  An  optimisation  theory  for  Maw-varying  linear 
systems  with  non-stationary  statistical  inputs.  Proc.  IRE,  i960,  M  8. 

118.  MMiarbaA.  nB-an-Metep  R.  Odaapyamaae  a  aocapoasacxe- 

aae  caraaaoa,  npaaatwx  aa  tone  nyMoa,  c  toexn  speaaa  reopen  erst acta eecx  ax 
pemeaaA.  B  cOopaaxe  nepeaoxoa  tnpnea  annyabcnux  caraaaoa  a  npacyrcraaa 
mynoat.  rocaaeproasnar,  1960.  ,  ,  ,  . 

119.  S  l  e  p  1  a  n  0.  Estimation  of  signal  parameters  in  the  presence  of 

noise.  Trans.  IRE,  POIT,  1984,  M3.  .  .  .  _ 

120.  M  i  d  d  1  e  t  o  n  D.  A  note  of  the  estimation  of  signal  wavefbrm.  Trans 
IRE*  ITe  I960,  M  2. 

*  t21*  B 1  a's  b  a  i  g  H.  Experimental  results  in  sequential  detection.  Trans. 

IRE.  IT,  1959,  M2.  ^  .  • 

122.  Ill  a  n  a  p  o  H.  H.  Paces?  Tpaexropaft  Oaaaacraescaax  caepaaoa  no 
xaaaHM  paxaoaoxanaoaawx  aaOamxeaaft.  Hax-ao  aaocrpaaaoi  aareparypsi.  1961. 

123.  Cpa  roan  e  B.  T.  O  paceere  xapaxTepacrax  odaapyxeuaa  npa 
xeaxparaenoM  cyMMapoeaaaa  caraaaoa.  fPaxaerexaaxa  a  aaexrpoaaxaa,  I960,  M  4. 

124.  31  a  a  a  a  B.  P.  OnrunaabaMe  Qesoeise  Mttewi  aeaapyaieana  caraa- 
aoa.  tPaxaorexaaxa  a  asexiponaxas,  I960,  M  4. 

125.  T  y  t  a  a  a  Jl.  C  Teopaa  oetausabaux  neioaoe  puaoopaetu  npa 
Aayxryanaoaaux  aonexax.  rocaaeproaMtr,  1961. 

126.  Baaapsaoi  A.  B.,  Ofelnaai  B.  C  Memxn cmuentee* 
cxoro  nocasAoeareabaoro  aaaaasa  a  ax  apaaoawaaa.  Hax-ao  rCoeercma  paxaoe, 
1962 

*  127.  B  a  x  y  ?  II.  A.  a  xp.  Bonpocu  ctaractweecxod  teopaa  paxaoaoxa* 
Kxx.  T.  I.  x  II.  HXA'BO  (CoscTcxoe  paxaoe,  1962,  1961. 

128.  X  a  a  c  ?  p  on  K.  Crwacmeeexsa  teopaa  oOnapyamaan  caraaaoa. 
Hsx-so  aaocrpaaaoi  aareparypH,  1961. 

129.  map  na  a  R.  fl. ,  roaxxti  B.  H.  Ocaoeu  teopaa  oOaapyate- 
aaa  paxaoaoxauaoaaux  caraaaoa  a  asneoeaan  ax  napaiseTpos.  Hax-ao.  •Coeercxoe 
paxaoe,  1963. 


634 


H.>*r y^t- t 


,\  _*. 


„«J15a?E?E35S5JL- 


130.  T  a  x  o  a  o  a  6.  H.  CriTRCrRSCcaaa  paxkOrekliRKa.  (Iu-bo  •Oobctck.  • 
p*AHO>,  I960. 

131.  Aeaan  B,.  P.  TeopetaaecKiie  ocaoau  craTiicTtiwcKoA  pajinoTe* 
hrkr.  KHara  BTopaa.  Hax-ao  iCoaeTcaoe  paxao*.  1968. 

132.  Uubkrh  H.  3.  Axanraaaa  a  oSyaeaae  a  aaroMarimecKnx  cacrcMat . 
H3A-bo  (CoaercKoe  paxHo*.  1968. 

133.  Bobbuixoi  H.  A.,  T  y  t  k  a  h  A.  G,  A  t  a  a  a  B.  P., 

CtpiToiom  P.  A.  MarcMaTawcaae  ocaoau  coapeMeaaoft  paxROMexTpn- 
aaaa.  Hax-ao  tCoacrcaoa  paxao*.  1968.  s 

134.  ^axbaoaaa  C  E.  Oueaaa  napaMerpoa  carttaxa.  Hax-ao  «Cobct< 
caoe  paxao*.  1970. 

138.  B*axMaa  A.  E.  Caoacaue  caraaxu  a . npaanan  HeonpexNtcHiiorTit 
a  paxaojioxanaa.  Hax-ao  iCooercxoe  paXao*.  1966. 

136.  71  a  a  a  a  10.  C  .  OarauajibHue  $axbrpu  a  aaKonarexa  MnnyxbCHMx 
caraaaoa.  Hax-ao  <Coarrcxoe  paxao*.  1963. 


137.  Mrxxator  A.  Oacpaa  teopaa  caasa.  Hax-ao  (CoaeTCxoe  paxao*. 

I960. 

138.  Ct  p  a  t  o  a  o  a  a  a  P.  A.  ycaoaaua  uapaoscaae  npoucccu  a  ax 

—  •  r  _  _ _ _ _  •  • ■  aaattr  a 


npliMeataae  a  Teopaa  onTauaxaaoro  ynpaaxcaaa.  Hax.  Mry,  1966. 

t39.  Aaaaatoa  H.  H.  Dpamacaae  reopaa  petueaaft  a  aaxaaaM  o6>ia- 
p^ateaaa  caraaaoa  a  auxcaeaaa  caraaaoa  aa  tuyuoa.  Hax.  BBHA  hm.  Wyxoacicoro. 

140.  BeaTuexb  E.  C  Bacxcaat  a  accxcxoaaaae  oncpanaA.  Ihx-ac 
fCoaarcaoo  paxao*.  1964. 

141.  Max-Kaaca  Rx.  Bbacxeaaa  a  leopaio  arp.  ♦asMaTraa,  I960. 

142.  A  b  »  c  P.,  P  a  A  +  a  X.  Hrpu  a  peawaaa.  Hax-ao  aaocrpaaiioft 
Aareparypu,  1961. 

143.  «B«ckohmruo  aararoaacTaaecaae  arpu*.  Cflopaax  craieA  nox  pex. 
H.  H.  BopoObeaa.  ♦Rauarraa,  1963. 

144.  T  y  t  a  a  a  A.  C,  B  o  p  a  c  o  a  10.  r  a  xp.  Puaoyapaaxcnae  peaa- 

trbhumh  caapaxaua  a  iocmrrccrbmb  annapara  •  Hax-ao  tCooercaoe  paxao*. 
1968.  .  ' 

145.  Bat  ay  i«i  A-.  Tapnax  M.  A.  Teopaa  arp  a  cTaTacraac- 
caax  pcmeaxft.  Hax-ao  aaocrpaaaoA  xareparypu.  1968. 

146.  CaOacTiaa  r.  C.  npoucccu  npaaaTRR  pemeaaA  npa  pacnoaaa- 
bxrbr  oOpaaoa.  Hax-ao  «Texaaaa»,  Kaea.  1965. 

147.  B  a  p  a  6  a  m  K).  A.  a  xp.  Bonpocu  craTacraaacxoA  reopaa  pacnoa- 
aaaaaax.  Hax-ao  aCoaorcaoe  paxao*.  1967. 

148.  A  a  a  a  a  a  K>.  B.  Marox  aaaHeabmax  aaaxparoa  a  ocaoau.  Teopaa 
otfpaOoTxa  aadxioxaaal.  $h3mattbs,  1962. 

149.  Baxauxa  P.  AxaaMaaacaoa  nporpaMMapoeaaae.  Hax-ao  aaocrpaa- 
aoA  xarcparypa,  I960. 

150.  A  a  a  a  a  i  X).  B.  CTaTaeraaecaae  suna  c  MemaiotanaB  napaaerpa- 
mb.  Hax-ao  tHayaa*.  I960. 

151.  A  a  u  a  a  3.  npoacpaa  cTaracraaccaax  ranorea.  Hax-ao  «H  ay  aa*.  1964. 

152.  B  a  a  -x*  p~  B  a  p  x  a  a  B.  A.  ManwaraaKKaa  CTaracraxa.  Ilsx-ao 
aaocrpaaaoA  xarapaTypu,  I960. 

153.  A  a  P.  OirrauMbaue  oueaaa,  onpexcAcaae  xapaRTepacraa  a  ynpaaxr- 
aaa.  Hax-ao  <Hayaa>,  1966. 

184.  CtpiTOiom  P.  A.  npaMtataae  reopaa  upouaccoa  Mapaoax 
aar  oarauMbHOi  AaxbTpauRR  caraaxoa.  «PaxaorexHaxa  a  MearpoaBaa*,  I960. 
MU. 

158.  CTparoaoaaa  P.  A.  Teopaa  onraMaxbaoA  aexaaeAaoA  Oaxb- 
rpauaa  cxyaaAaux  ^yaanaA.  tTaopaa  aepoaraoctcA  a  e*  npaMCHeaaa*.  1959,  M  2. 

186.  B  ox  butaxoa  H.  A.,  Pena  a  B.  T.  Bonpocu  acxaaeARoA 
OaxbTpaaaa.  tAaroMaraxa  a  TexeMexaRaxa*.  1961,  M  4. 

187.  FyTuux  A.  C  noreauaMbaaa  roaaocrb  asMcpeaaa  ■  oaroks- 
aiAbaux  a  HRoroaaaaxbaux  aaiMparexax  napauerpoa  earaaaa.  (PaxRorexaaxa*. 
1964.  M  3  a  4. 


/445 


/446 


158.  CTpiTORSiM  P.  Jl.  OnraMaxaHuA  itpMM  yaaonoxoeaoro earaa- 
aa  c  iteHuecTHoft  nacroroi  n  Aoaa  uyMoa.  (PaxaoTexHHaa  n  MtaTponnaat, 
1961,  Mi  7. 

159.  r  y  t  *  h  a  Jl.  C  Cpaaaennc  peuiii o>  h  noremwaxbHoA  tovhoctcA 
neaeHraaHH.  «Paxaoraxaxaa»,  1985,  M(i  1986,  M  3. 

160.  CrpiToaoii1!  P.  Jl.  Cyotecrayer  xa  reopaa  chktmb  otiTHiiaxb- 
■nix  axanraanux,  caMoo6ynaaMnaxca  axa  caMonacrpaHaaioaiaxca  cacrea?  iAbtom«- 
THxa  a  rexeMexaaaaaa,  4968,  M  1. 

161.  U  u  n  a  a  a  51.  3.  A  ace  ace  cymecrayer  an  Teopna  cnnresa  onTnnajib* 
nwx  axanmaMix  cuctcm?  (AaroMaraaa  a  TMeMexanaaa),  1968,  M  I. 

162.  Ptna  B.  T..  TapTaaoicanA  r.  n.  Axanranaa  cHcreM 
npacaa  a  oOpaforaa  aaAopiiaaaa  a  reopaa  craTacTanecaax  pemenaA.  (AaroMamaa 
n  TcacMcxaaaaa*.  1968,  M3. 

163.  K  o  p  a  A  o  B.  A.  Onanubaoc  oOaapyxeam  cxynaAaux  caraaaoa  aa 
(pone  caynaAmix  no*wx  neaaaecraoA  aanaeaaaeeni  apa  ycaoaaa  nocToancraa  aa- 
poaiuoera  aoacaol  tpeaora.  (Paxaoraxaaaa  a  aaaarpoaaaa*,  1968,  M  5. 

164.  Kop<|0  B.  A.  OaraiiaabaM  otfaapyawaac  xarepaia aapoaa aaux 
caraaaoa  co  cayaaKawia  napanerpaMa  aa  +oaa  ao mbx  e  aaaaatCTaoi  aareacaaao* 
crbio  apa  nocroaaaoi  aapoaraoera  aoaiaoA  rpaaora.  cPuaereaaaaa  a  aacxTpona* 
Ki»  1968  M  8. 

165.  ’A  m  a  t  p  a  a  a  a  o  A.  H.,  K  o  p  a  A  o  B.  A.  Xapaaiepacraaa  o6aa- 
pyxeaia  aorepanaol  nan  aa  axnyabcoa  e  aaaaeraoA  aanaabaoA  +asoA  aa  pom  rayc- 
coaol  noiiexa  c  aaaaaaeraoA  aaraacmaocxMa.  (Puaoroxaaaa  a  aacarpoanaa*. 
1968,  Kb  9.  Tan  xo  crma  Tax  xa  aaropoa  06  oOaapyxaaaa  naaera  aaaaaacaim 

•"HaTRtTSSfTo  A.  H.,  K  apa  ao  B.  A.  Xapaarapacraaa  oOaapy* 
xcaaa  aoraparfraol  nanaa  auayabcoo  co  cayaaAaoA  aaaaabaoft  #aaoA  aa  foaa 
rayccoaoA  nouexa  c  aaaaaaeraoA  aaraacaaaaefwa.  (Paxaotaxtnaa  a  aaoarpoaaaa*. 
I9W  J4  12. 

*167.  K  o  p  a  A  o  B.  A.  00  oaraumaoM  oAwpyxraaa  caraaaoa  opaxtAcr* 
aaa  noucx  c  naasaaeratraa  aapaawtpawa.  .Puaoraxaaaa  a  aatarpoaaaa*,  I960, 

14  2. 

'  J68.  Jl  a  a  a  a  B.  P.,  Kyaup  A.  ♦.  Aeamnoraaacaa  oaTaMUbaue 
aaropanni  oOaapyxaaaa  a  aaaxaoeaaa  caraaaoa  aa  poaa  aotwa.  .Pwaoiaxaaaa 
a  aaearpoaaaa*,  1969,  Ml. 

169.  Jl  a  a  a  a  B.  P.,  K  y  ■  a  a  p  A.  9.  AcaMaroraaacaa  oaraaMbaua 
paaroawa  aaropanni  oOaapyxaaaa  caraaaoa  aa  pone  noucx.  (Puaoraxaaaa  a 
aaaarpoaaaa»,  1969,  M2. 

179.  Jl  a  a  a  a  B.  P.,  T  p  o  a  a  a  a  A  B.  B.  noaaaa  aapoaraocro  oxaO- 
aa  op*  aaaecapaaaana  aopnaabaux  coooayaaocraA,  paaxanaio xaxca  aearopana 
cpaaaax.  «&oraiMvata  a  raaaMaaaaaaa*.  1970,  M  I. 

171.  riAaiaaa  M.  10.  npamaaaaa  raopaa  arp  a  aaaoropMM  aatanaa 
aaroyapaaaaaaa.  (Aaitmaraaa  a  raatMexaaaaa*,  1963.  Alai. 

172.  Italian  51.  3.  Aatattmaa. oOynaana a cawooOyaaaaa a aaroMiTana- 
caas  cacmux.  «AaroMaraM  a  raaoaaaaaaax,  1966,  M  1. 

178.  r  y  t  a  a  a  Jl.  C-JTaopaa  rnmmmmn  m ram  pwaoraatM*  (paa- 
am  aOaopa  paaaaraa  raopaa  aa+opwaa  ■  COOP).  (Ttxaaaacaoa  aaOapatraaa», 
1988  14  8. 

’  174.  r.y  r  a  a  a  Jl.  C.  O  aaaoaa  ycraaooaaaaa  jacaaptaa  myut  aa  aura* 
At  eoraacooaaaoro  aaaataoro  paabtpa.  tHaaicraa  ayaaaa.  Puamaaarpoaaaa. 

1970,  M  2. 

176.  «  a  a  a  Jl.  M.  Taopaa  atpuna  Aacapamwx  cooOxaaal.  Htt-ao 
(Coaercaoa  paxno.,  1970. 

178.  B  o  a  b  ■  a  a  o  a  H.  A.  Buxtxamn  xrroaa  caraama  aa  myna.  Hax-ao 
(Coaercaoa  pax  no.,  1989. 

177.  n  a  e  r  p  a  a  o  a  B.  B.  ♦aaoawa  pttaoraxmnacaaa  camnw.  Hax-ao 
(Coaarcaoa  puaoa,  1968.  „  _  .  _  ..  _ 

178,  0  a  a  a  a  x  b  a  r  a  A  a  M.  n.  rpaOaanama  paxarpu.  Hax-ao  (Co* 
aarcaoa  paxaoa,  1966. 


179.  K  y  s  fc  m  a  ■  C.  3.  Un&poaaa  oOpaOoraa  paxROxoaatiRORRoA  RmpopMa. 
aaa.  Hax-ao  tCosereaoa  paxRoa,  1967. 

180.  Caacoiuao  C.  B.  LlH$poaue  mctoiw  onTHMixhHoA  oOpattorioi 
paxRMOKanaoRMix  camaxoi.  BoeRHsaiT,  1968. 

181.  Hipan  K>.  B.  O  XHHeARUX  caoAcraax  crctcmu  uiRpoRonaxocHuA 
orpaaaairTfcnk— 4aanp.  «PuRorexHRaa  h  MexrpoHaxo,  1962,  M  7. 

182.  B  o  p  o  a  a  a  JI.  <0.  Baexettae  a  Teopaio  noiwxoycToftaaaoro  Konupo- 
aaaaa.  Hsx-ao  «Coaercxoe  paxRba,  1968. 

183.  OlaeToaa  r.  A.  KoAnpoaann*  a  DOHcxoycToAaaaocTb  TexettexaHH- 
xecxoA  HwbopMaaaH.  Hax-ao  «3Heprax»,  1966. 

184.  CtpaTOHOin  P.  SI.  0  ueHHOcra  RR^opMaaaa.  cTexHaaecaan 

aatfepneTHRaa,  1965,  t.  5.  s. 

185.  Ill  e  a  a  o  a  K.  Patfoni  noTeopaa  Ht^opwaaaM  a  xa6epaeraac.  Hax-ao 
aaocrpaHRoA  xmcparypii,  1963. 

186.  Bapuaaap  B.  A.  K  cpaaacRaRt  cnocoOoa  npaeMa  caraajioa. 
•PaxaoTexHRxa*,  1962.  Ml. 

187.  Ql  a  p  o  a  A.  H.  HxeaxkHwA  npaeia  cnrAaxoa  orrraMMuroro  icoai 
aPuaoraXRaaaa,  1961,  14  9. 

188.  nectpaxoa  B.  B.  OnTH**axi>Roe  oOHapyxeaae  paxaocRrnaxoa, 
a.  I.  Hax.  M3HC,  1967. 

189.  71  e  t  p  o  a  r  a  H.  T.,  Paiaauat  M.  K.  Cacreim  canaa  c  mywo- 
noxoflHWMR  cRrRMBMa.  Hs*-ao  fCoaercROe  paxaot,  1969. 

190.  Poam  A.  <0.  lloporoaua  caoAcraa  a  ooTNMuaHin  napaaetpii  car- 
rljob  npa  paxaoRoaanHORHUX  aaiMpeaaax.  «PaxRor*xaBxa»,  1968.  M  6. 

191.  aoaia  A.  9».  Onenaa  XPCiotepBOCTB  napaxna  coo6u»rrA  npa 
aciKMuoaaaaa  aaaxoroiHX  mapoaonoxocHHX  caraaxoa.  «PuaoTexB«xa».  1970, 
M  5. 

192.  R  o  b  b  I  n  a  H.,  M'o  nro  $.  A  atocbaatie  approximation  method. 
Annalet  of  Math.  Statlatlca.  1951,  r.  22,  M  I. 

193.  K  1 « ( a  r  E.,  W  o  1  f  o  w  1 1  a- J.  Stochaatle  catlmation  of  the  maxi, 
mum  of  rennaaion  functiona.  Annalea  of  Math.  Statiatlca,  1952,  a.  23, 19  3. 

194.  niTapeoa  H.  Jl.  CTarHCTaaecaaA  aauaa  a  oaraiiaaaaaa  crctcm 
aBTOHaraaecKoro  ynpaaxeaan.  Hu-ao  «Coaercaoa  paxaot,  1964. 

IMS.  71  o  r  a  a  o  a  H.  B.  Mrroxu  croxaemacxoA  annpoxcanmaa.  «Aaro- 
Haraaa  a  Taxcnaxaaaxaa,  1966,  M  4. 

196.  K  o  p  a  x  o  B.  A.  06  oirrmauanoM  oORapyaccaaa  caraaxoa  Ha  <poHe 
nomex  c  aeaaaecrauMR  napaaerpaua  npa  orpaHRaeHRoft  aepoRTROcra  xoataoA  rpe- 
aora.  (Paxaoraxaaaa  a  axeareoaaxaa,  1970,  M  7. 

•  197.  B  o  x  a  m  a  a  o  a  H.  A.  a  xp.  Hexoropwe  aonpocw  cTaracTnecaoro 
eaama  aatopMaaaoaaux  cwm  aTaxRaaccaax  aa6apacTRaa».  1970,  M2. 

198.  II  u  a  a  a  a  fit.  3,  Ocaoau  Teopaa  o6yaammaxca  cacrea.  Hax-ao 
«Hayaa>,  1970. 


